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Preface to the second edition 


The following are the main features of the second edition. 


More than 150 new problems and examples have been added. 
The new problems include several that relate abstract concepts 
to concrete situations. Among others, we present applications 
of G-sets, the division algorithm and greatest common divisors 
in a given euclidean domain. In particular, we should mention 
the combinatorial applications of the Burnside theorem to real- 
life problems. A proof for the constructibility of a regular n-gon 
has been included in Chapter 18. 

We have included a recent elegant and elementary proof, due to 
Osofsky, of the celebrated Nocther-Lasker theorem. 

Chapter 22 on tensor products with an introduction to categories 
and functors 1s a new addition to Part IV. This chapter provides 
basic results on tensor products that are useful and important 
in present-day mathematics. 


We are pleased to thank all of the professors and students in the many 
universities who used this textbook during the past seven years and contri- 
buted their useful feedback. In particular, we would like to thank Sergio 
R. Lopez-Permouth for his help during the time when the revised edition 
was being prepared. Finally, we would like to acknowledge the staff of 
Cambridge University Press for their help in bringing out this second 
edition so efficiently. 


P. B. Bhattacharya 
S. K. Jain 
S. R. Nagpaul 
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Preface to the first edition 


This book is intended for seniors and beginning graduate students. It is 
self-contained and covers the topics usually taught at this level. 

The book is divided into five parts (see diagram). Part I (Chapters 1-3) 
is a prerequisite for the rest of the book. It contains an informal introduc- 
tion to sets, number systems, matrices, and determinants. Results proved 
in Chapter | include the Schroder-Bernstein theorem and the cardinality 
of the set of real numbers. In Chapter 2, starting from the well-ordering 
principle of natural numbers, some important algebraic properties of 
integers have been proved. Chapter 3 deals with matrices and determinants. 
It is expected that students would already be familiar with most of the 
material in Part I before reaching their senior year. Therefore, it can be 
completed rapidly, skipped altogether, or simply referred to as necessary. 

Part II (Chapters 4-8) deals with groups. Chapters 4 and 5 provide a 
foundation in the basic concepts in groups, including G-sets and their 
applications. Normal series, solvable groups, and the Jordan-Holder 
theorem are given in Chapter 6. The simplicity of the alternating group 
A, and the nonsolvability of S,,n > 4, are proved in Chapter 7. Chapter 8 
contains the theorem on the decomposition of a finitely generated abelian 
group as a direct sum of cyclic groups, and the Sylow theorems. The 
invariants of a finite abelian group and the structure of groups of orders 
p’, pq, where p, q are primes, are given as applications. 

Part III (Chapters 9-14) deals with rings and modules. Chapters 9-11 
cover the basic concepts of rings, illustrated by numerous examples, 
including prime ideals, maximal ideals, UFD, PID, and so forth. Chapter 
12 deals with the ring of fractions of a commutative ring with respect to 
a multiplicative set. Chapter 13 contains a systematic development of 
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Xvi Preface to the first edition 


integers, starting from Peano's axioms. Chapter 14 is an introduction to 
modules and vector spaces. Topics discussed include completely reducible 
modules, free modules, and rank. 

Part 1V (Chapters 15-18) is concerned with field theory. Chapters 15 
and 16 contain the usual material on algebraic extensions, including 
existence and uniqueness of algebraic closure, and normal and separable 
extensions. Chapter 17 gives the fundamental theorem of Galois theory 
and its application to the fundamental theorem of algebra. Chapter 18 
gives applications of Galois theory to some classical problems in algebra 
and geometry. 

Part V (Chapters 19-21) covers some additional topics not usually 
taught at the undergraduate level. Chapter 19 deals with modules with 
chain conditions leading to the Wedderburn—Artin theorem for semi- 
simple artinian rings. Chapter 20 deals with the rank of a matrix over a 
PID through Smith normal form. Chapter 21 gives the structure of a 
finitely generated module over a PID and its applications to linear algebra. 

Parts II and Hil are almost independent and may be studied in any 
order. Part IV requires a knowledge of portions to Parts Il and III. It 
can be studied after acquiring a basic knowledge of groups, rings, and 
vector spaces. The precise dependence of Part IV on the rest of the book 
can be found from the table of interdependence of chapters. 

The book can be used for a one-year course on abstract algerba. The 
material presented here is in fact somewhat more than most instructors 
would normally teach. Therefore, it provides flexibility in selection of the 
topics to be taught. A two-quarter course in abstract algebra may cover 
the following: groups ~ Chapters 4, 5, and 7 (Section 1) and 8; rings - 
Chapters 9, 10, 11, and 14 (Sections 1-3); field theory - Chapters 15, 16, and 
18 (Section 5). A two-semester course in abstract algebra can cover all of 
the material in Parts II, III, and IV. 

Numerous examples have been worked out throughout the book to 
illustrate the concepts and to show the techniques of solving problems. 
There are also many challenging problems for the talented student. We 
have also provided solutions to the odd-numbered problems at the end 
of the book. We hope these will be used by students mostly for comparison 
with their own solutions. 

Numbering of theorems, lemmas, and examples is done afresh in each 
chapter by section. If reference is made to a result occurring in a previous 
chapter, then only the chapter number is mentioned alongside. In all cases 
the additional information needed to identify a reference is provided. 

The book has evolved from our experience in teaching algebra for many 
years at the undergraduate and graduate levels. The material has been 
class tested through mimeographed notes distributed to the students. 
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We acknowledge our indebtedness to numerous authors whose books 
have influenced our writing. In particular, we mention P. M. Cohn’s Algebra, 
Vols. 1, 2, John Wiley, New York, 1974, 1977, and S. Lang’s Algebra, 
Addison-Wesley, Reading, MA, 1965. 

During the preparation of this book we received valuable help from 
several colleagues and graduate students. We express our gratitude to all 
of them. We also express our gratefulness to Ohio University for providing 
us the facilities to work together in the congenial environment of its 
beautiful campus. 

It is our pleasant duty to express our gratitude to Professor Donald 
O. Norris, Chairman, Department of Mathematics. Ohio University, 
whose encouragement and unstinted support enabled us to complete our 
project. We also thank Mrs. Stephanie Goldsberry for the splendid job 
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{x € A|P(x)) 
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for all 

there exists 

is an element of 

is not an element of 

set of all x € A satisfying condition P(x) 
power set of x 

family indexed by set A 


union of (X)jiea 
intersection of (X))c, 


Cartesian product of X and Y 
empty set 

is a subset of 

is a proper subset of 

contains 

properly contains 

implies 

if and only if 

if and only if 

fis a map of X into Y 

image of x © X under f: X > Y 
y=f(x) where fi: X > Y,xEX,yEeY 
composition 

Euler’s function 


Glossary of symbols XIX 


Z/(n) or Z, 
|X| or card X 
|G| 

[S] 

C, 

Sr 


A, 


D, 
GL(m, F) 
Z(G) 

dq 

A/B 
{L:K] 


MS) (Nz(S)) 
C(S) (Cu(S)) 


in number theory. the greatest common divisor of a 
and 5; in rings and modules, the ideal or submodule 
generated by a and b 

a divides b 

a does not divide b 

Kronecker delta 

determinant 

+1, according as the permutation ¢ 1s even or odd 
square matrix with | in (i, 7) position, 0 elsewhere 
the set of positive integers (1,2,3,...,71) 

set of all natural numbers 

set of all integers 

set of all rational numbers 

set of all real numbers 

set of all complex numbers 

the cardinal of the continuum (cardinality of the reals) 
integers modulo n 

cardinality of X 

order of group G 

subgroup generated by S 

cyclic group of order n 

as a group, the symmetric group of degree n; as a ring, 
the ring of n X n matrices over S 

alternating group of degree n 

dihedral group of degree n 

group of invertible m X m matrices over F 

center of G 

is a normal subgroup of 

quotient group (ring, module) of A modulo B 

in groups, the index of a subgroup K in a group L; in 
vector spaces, the dimension of a vector space L over K; 
in fields, the degree of extension of L over K 
normalizer of S (in H) 

conjugate class of S (with respect to //) 


product of (X))iea 
direct sum of (X,)ie, 


image of homomorphism f{ 
kernel of homomorphism f{ 

is isomorphic into (embeddable) 
is isomorphic onto 


XX Glossary to Symbols 


Hom,(X, Y) 


Hom(X, Y) 
End(X) 


opposite ring of R 

ideal (submodule) generated by S 

right ideal generated by S 

left ideal generated by S 

sum of right or left ideals (submodules) (X;),<q 


polynomial ring over R in one indeterminate x 
polynomial ring over R in 7 indeterminates, x,,...,X, 
formal power series ring 

ring of formal Laurent series 

rationals between 0 and | of the form m/p", m,n > 0, 
under the binary operation “addition modulo |” 
set of all R-homomorphisms of R-module X to R- 
module Y 

set of all homomorphisms of X to Y 
endomorphisms of X 

automorphisms of X 

localization of a ring R at S 

subfield generated by F and a 

subring generated by F and § 

subfield generated by F and S 

Galois field (finite field) with g elements 

algebraic closure of F 

fixed field of H 

Galois group of automorphisms of E over F 
cyclotomic polynomial of degree n 

tensor product of Mz, and 2N 

end of the proof 
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Preliminaries 


CHAPTER 1! 


Sets and mappings 


1 Sets 


The concept of set is fundamental in mathematics. It is not our purpose to 
present here an axiomatic account of set theory. Instead we shall assume 
an intuitive understanding of the terms “‘set’”’ and ‘“‘belongs to.” Infor- 
mally speaking, we say that a set is a collection of objects {or elements). 

If Sis aset and x is an element of the set S, we say x belongs to S, and we 
write x € S. Anelement ofa set Sis also called a member of S. If.x does not 
belong to S, we write x ¢ S. 

Let A and B be sets. We say that 4 and B are equal, written A = B, if 
they consist of the same elements; that is, 


xEASXE B. 


(The symbol = stands for “‘ifand only if.”’) A set is thus determined by its 
elements. 

A set with a finite number of elements can be exhibited by writing all of 
its elements between braces and inserting commas between elements. 
Thus, {1,2,3} denotes the set whose elements are 1, 2, and 3. The order in 
which the elements are written makes no difference. Thus, {1,2,3} and 
(2,1,3} denote the same set. Also, repetition of an element has no effect. 
For example, ({1,2,3,2) is the same set as {1,2,3}. Given a set A and a 
statement P(x), there is a unique set B whose elements are precisely those 
elements x of A for which P(x) is true. In symbols, we write 
B = {x € A|P(x)}). When the context is clear, we sometimes also write B = 
{x|P(x)}. 

There are standard symbols for several sets that we frequently deal 
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with. Some of these are given below. Others will be introduced subse- 
quently as the occasion arises. 


N denotes the set of all natural numbers 1,2,3..... 

Z is the set of all integers 0,+ 1,+ 2..... 

Q is the set of all rational numbers - that is, fractions a/b, where 
a,b are integers and b # 0. 

R is the set of all real numbers. 

C is the set of all complex numbers x + /y, where x,y are real 
numbers and i? = — 1. 


For any positive integer 7, the set (1,2,...,7} is denoted by n. A set S 1s 
called finite if it has no elements, or if its elements can be listed (counted, 
labeled) by natural numbers 1,2,3,... and the process of listing stops at a 
certain number, say 7. The number zn is called the cardinality of S, and we 
write |S] = 1. A set whose elements cannot be listed by the natural num- 
bers 1,2,...,7 for any n whatsoever is called an infinite set. 

A set S is said to be empty if S has no elements; that is, the statement 
x € Sis not true for any x. If Sand 7 are both empty sets, then S = 7, 
since the condition x € S <> x € Tis satisfied because there is no element 
xin either Sor J to which the condition may be applied. (In such a case we 
say that the condition is satisfied vacuously.) Because any two empty sets 
are equal, there is just one empty set, which is denoted by ©. The empty 
set is also called the nui/ set or the void set. 


Definition. Let A and B be sets. A is called a subset of B if every element of 
A is an element of B; that is, if 


aE A>acB, 
(The symbol => stands for “implies.”’) 


If A is a subset of B, we wnte AC B. (Some authors write 4 C B.) 
Further, if A 1s a subset of B, we also say that B contains (or includes) A, 
and we write B D A (or B D A). 

It follows immediately from the definition that A and B are equal ifand 
onlyif A C Band BC A. Thus, every set is a subset of itself. Moreover, the 
empty set © is a subset of every set because the condition x € 0 > xE A 
is satisfied vacuously. 

If SC A, but S # A, then S is a proper subset of A written as S¢ A. 


Definition. Let A and B be subsets ofa set U. The union of A and B, written 
AUB, is the set 


AUB=({x€ UlxE A orxeE B}. 
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The intersection of A and B, written A‘ B, is the set 

ANB=(xE€ UlxE A and xc B}. 
The difference of A and B, written A — B, is the set 

A-B=({xE€ UlxE€ Aandx¢ B}. 


IfBC A, then A — B is called the complement of B in A. A and Bare said 
to be disjoint if A N B is empty (AN B= ©). 


For example, if A = (1,2,3) and B = (3,4,5}, then A U B = (1,2,3,4,5), 
AN B= (3), A — B= (1,2), and B — A = {4,5}. 


AUB ANB 


The term universal set is sometimes used for a set U that contains all 
sets in a given context; that is, Y¥ C U for every set X under consideration. 
The complement of X in U (namely, the set U —_X) is then simply called 
the complement of X and is written X’ without explicit reference to U. 

It is sometimes helpful to illustrate union, intersection, difference, and 
complement by means of Venn diagrams. We draw circles to represent 
the given sets A and B and enclose them within a rectangle representing 
the universal set U. The shaded area in each diagram represents the set 
A U B, and so forth, as indicated. 


1.1 Theorem. Let A, B, and C be sets. Then 


(t{) AVUA=A=ANA. 
(it) AUB=BUA;ANB=BNA. 
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(iti) (AUB)UC=AU(BUC),(ANB)NC=AN(BNC). 

(iv) AU(BNC)=(AUB)N(AUC), 
AN(BUC)=(ANB)U(ANC). 

(v) AU(ANB)=A=AN(AUB). 


Proof. Left as an exercise. O 


1.2 Theorem (DeMorgan’s rules). Let A, B, and X be sets. Then 


(t) X-—(X—-A)=XNA. 
(ti) X-—(AUB)=(X—A)N(X— B). 
(iti) X —(AQNB)=(X — A)U(X — B). 


Proof of (i): 


xEX—(X —A)exE€ X and x ¢ (X — A) 
—>xéEXandxEaA 
—xEXNA. 


Hence, X —(X —A)=XNA. O 


Proof of (ii): 


xEX—-(AUB)SxEXandx¢é (AUB) 
—xéENXandx¢éAandx¢ B 
<=> (x © X and x € A) and (x © X and x ¢ B) 
—xEX—-AandxEeX-—B 
= xE(X —A)N(X — B). 


Hence, Y — (A U B) = (X — A)N(X — B). 
The last part is proved similarly. O 


In view of the equality (A U B) UC = A U(B UC) (Theorem 1.1), we 
can do away with the parentheses and simply write A U B U C to denote 
unambiguously the union of the sets 4, B, and C. Moreover, it is clear that 
the set A U B U C consists of all those elements that belong to at least one 
of the sets A, B, and C. Likewise, A M BN C, the intersection of A, B, and 
C, is the set of those elements that belong to each of the sets A, B, and C. 
This suggests the following definition for the union and intersection of an 
arbitrary number of sets. 


Definition. Let S be a set whose elements are themselves sets. The union of 
all sets in S is defined to be the set 


{x|x € X for some X in S) 
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and is denoted by Uy_;X. The intersection ofall sets in S is defined to be 
the set 


(x|x © X for every X in S} 
and is denoted by NvesX. 


If S contains only a finite number of sets, say X,,...,X,, their union is 
written U_X,or X, U «++ U-X,, and their intersection is written (V_, X, 
or X,;N °: NX,. 


Definition. Let X bea set. The set of all subsets of X is called the power set 
of X and is denoted by P(X). That is, 
P(X) =(SISC X). 


Recall that the empty set O and the set_X itself are subsets of X and are 
therefore elements of A(X). For example, let _Y = (1,2}. Then the subsets 
of X are @, {1}, (2}, and X. Hence, 


P(X) = (O,(1},(2),X }. 


If X is the empty set ©, then P(X) has just one element: ©. 
It should be noted that a and (a) are not the same. If a €_X, then {a} € 
P(X). 


13 Theorem. Let X be a finite set having n elements. Then P(X ) has 
2" subsets. Consequently, |P(X )| = 2'41. 


Proof. Let us first consider those subsets of X that have r elements each, 
where 0 = rs 7. It is shown in high school algebra that the number of 
ways in which r elements can be selected out of 7 elements is 


(") _ en 
r n(n — r)’ 


which is therefore the number of subsets of X having r elements each. 
Hence, the total number of subsets of X is ="_,(”). On putting a = | inthe 
binomial expansion 


(+ar=> ("Jar 


r=0 


we get 


y (")=« +1 = 2", 


r= 
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This proves that X has exactly 2” subsets. Hence, ?(X) has 2” elements. 
Since n = |X|, we get |P(X)| = 2'*. O 


Incidentally, the equality |P(X)| = 2'*' explains why the set of all sub- 
sets of X is called the power set of X. 


Definition. Let X be a set. Let n be a set whose elements are nonempty 
subsets of X; that is, n C P(X) and O € nx. If the elements of x are pairwise 
disjoint and their union is X, then x is called a partition of X, and the 
elements of nx are called blocks of the partition 7. 


For example, the set z = {{1,2},(3},(4,5}} is a partition of the set X = 
(1,2,3,4,5). 
The next theorem follows immediately from the definition. 


1.4 Theorem. Let X be a set. Let x be a set whose elements are 
nonempty subsets of X. Then 7 is a partition of X ifand only if each element 
of X belongs to exactly one element of 7. 


Proof. Exercise. O 
Note that if X is empty, it has only the partition x = ©. 
Problems 


1. Prove Theorem 1.1. 

2. Prove Theorem |.2(ii1). 

3. Ifaset A has m elements and a set B has n elements, find the 
number of elements in A U B. Assume that 4A N Bhas k elements. 

4. After the registration of 100 freshmen, the following statistics 
were revealed: 60 were taking English, 44 were taking physics, 30 
were taking French, 15 were taking physics and French, 6 were 
taking both English and physics but not French, 24 were taking 
English and French, and 10 were taking all three subjects. 


(a) Show that 54 were enrolled in only one of the three subjects. 
(b) Show that 35 were enrolled in at least two of them. 


5. During quality control checking of a sample of 1000 TV sets, it 
was found that 100 sets had a defective picture tube, 75 sets hada 
defective sound system, 80 sets had a defective remote control 
system, 20 sets had a defective picture tube and a defective re- 
mote control, 30 sets had a defective picture tube and a defective 
sound system, 15 sets had a defective sound system and a defec- 
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tive remote control system, and 5 sets had all three defects. Use 
Venn diagrams to show that 

(a) 195 sets had at least one defect. 

(b) 805 sets had no defects. 

(c) 55 sets had a defective picture tube only. 

(d) 35 sets had a defective sound system only. 

(e) 50 sets had a defective remote control only. 


2 Relations 


Definition. Let a,b be elements ofa set S. Then the set {{a},(a,b)} is called 
an ordered pair and is denoted by (a,b); a is called the first component (or 
coordinate), and b is the second component (or coordinate). 


We now show that (a,b) = (c,d) if and only if a= cand b= d. 
If a=c and b=d, then trivially (a,b) = (c,d). Conversely, let 
(a,b) = (c,d). Then 


{(a),(a,5)} = {{c},(c,d)). 
By definition of equality of sets, this implies 
{a}=(c} or {a}= (c,d). 


If {a} = (c}, then we must have (a,b) = (c,d). This yields a = c,b = d. If, 
on the other hand, (a) = (c,d), then we must have (a,b) =(c).Soa=c=d 
and a = b =, which implies a = c = b= d. 


Definition. Let A,B be sets. The set of all ordered pairs (x,y), where x © A 
and y € B, is called the cartesian product of A and B, in that order, and is 
denoted by A X B. In symbols, 


AX B= {((x,y)|x EA, yE B). 


For example, if A=(1,2}) and B=({a,b,c) then AX B= 
{(1,@),(1,5),(1,c),(2,4),(2,5),(2,¢)}. 

The term “cartesian” is borrowed from coordinate geometry, where a 
point in the plane is represented by an ordered pair of real numbers (x,y) 
called its cartesian coordinates. The cartesian product R X R is then the 
set of cartesian coordinates of all points in the plane. 


Definition. Let A and B be sets, and let R bea subset of A X B. Then R is 
called a relation from A to B. If(x.y) € R, then x is said to be in relation R 
toy, written x R y. A relation from A to A is called a relation on A (or in A). 
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Strictly speaking, a relation is determined by three sets, A,B and a 
subset R of A X B, although we call it simply the relation R. If R isa 
relation from A to B, and S is a relation from C to D, then R and S are 
equal if A =C, B= D, and, forallxE A, yEB,xRyexSy. 


Definition. Let R be a relation in the set X. R is said to be 


(a) reflexive ifx R x forall x € X; 

(b) symmetric ifx R y implies y R x for all x,y € X; 

(c) antisymmetric ifx R yand y Rx imply x= y forall xy€X; 
(d) transitive ifx R yand y Rz imply x Rz forall x,y,z € X. 


If R is reflexive, symmetric, and transitive, then R is called an equivalence 
relation on X. If R is reflexive, antisymmetric, and transitive, then R is 
called a partial order on X. 


2.1 Examples 


(a) Let X be the set of all lines in a plane. For x,y € X let xlly mean that x 1s 
parallel to y. Let us further agree that every line is parallel to itself. Then tis 
an equivalence relation on_X. Similarly, congruence of triangles and simi- 
larity of triangles are equivalence relations. 

(b) Let X be a set whose elements are themselves sets. Consider the 
relation C determined by “‘set inclusion.” For any sets 4,B,C € X we see 
that 


(i) ACA; 
(i) ifA4C Band BCA, then A=B; 
(ii) if dC Band BCC, thenACC. 


Hence, set inclusion is reflexive, antisymmetric, and transitive; therefore 
it is a partial order on_X. 

(c) The relation = (‘“‘less than or equal to’’) on the set R of real numbers 
is reflexive, antisymmetric, and transitive; therefore, it is a partial order 
on R. 

(d) The relation congruence modulo n on Z is defined as follows. Let n 
be a fixed positive integer. For any x,y € Z, x is said to be congruent to y 
(modulo n), written 


x=y (mod n), 
if n divides x — y. Now, for any x,y,z in Z, it is true that 


(i) mn divides x — x = 0; hence, x = x (mod n); 
(11) if n divides x — y, then n divides y — x; 
(ii) if n divides x — y and also y — z, then n divides x — z. 
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This proves that congruence modulo 7 is an equivalence relation on Z. 


Let X be a set, and let <= be a partial order on XY. (The symbol = here 
denotes an arbitrary partial order and does not necessarily have its usual 
meaning of “less than or equal to” in real numbers.) The set X together 
with the partial order S is called a partially ordered set or, briefly, a poset. 
We refer to it as the poset (X,s) or simply the poset YX. 

Let (X,S) be a poset, and let x,y © X. If x s y, then x is said to be 
contained in y. If x s yand x ¥ y, then xis said to be properly contained in 
y, written x < y. If x < y and there is no element a in X such that 
x<a<~y, then y is said to cover x. 

A finite poset X can be represented by a diagram in the following 
manner. Represent each element in XY bya small circle (or a point) in such 
a way that whenever x < y, then yis higher than xin the diagram. Further, 
join x and y by a straight segment whenever y covers x. As an illustration, 
we give below the diagrams for the following three posets: 


(a) {1,2,3,4,5,6} ordered by the usual relation of “less than or equal 
to”: 

(b) (1,2,3,4,5,6} ordered by divisibility; 

(c) #({1,2,3)) ordered by set inclusion. 


(1, 2,3} 


6 
6 
4 
1,2 
(1, 2} (2.3) 
2 S 4 
(1) (3) 
Z 1 
4) 


(a) (b) (c) 


A 


> 


ed 


Let (X, <) bea partially ordered set. Let S be a subset of X. An upper 
bound of S is an element be X such that x < b for all xeS. A least upper 
bound (l.u.b.) of S is an element me X such that (i) x < m for all xeS and 
(11) if x < m’ for all xeS then m < m’. A lower bound and a greatest lower 
bound (g.|.b.) are defined analogously. 

It can be easily shown that a I.u.b. (g.1.b.) of S, if it exists, is unique. 
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Definition. A partially ordered set L is called a lattice if every pair of 
elements in L has a least upper bound and a greatest lower bound. 

The least upper bound and the greatest lower bound of {x, y} are 
written x v y and xa y, respectively (called join and meet of x, y, 
respectively). 

The diagrams (a) and (c) above represent lattices. 

A poset (X, <)1in which for all x, ye X, either x < y or y < x is called 
a chain or a totally ordered set. Every chain is a lattice. 

For any set S, the power set A(S), ordered by set inclusion, is a lattice. 
Here Av B=AUB and Aa B=AQZB for all A, BEA(S). 


Definition. Let E be an equivalence relation ona set X, and leta € X. The 
set of all elements in X that are in relation E toa is called the equivalence 
class of a under E and is denoted by E(a). That is, 


E(a) = (xE X|x Ea). 


A subset C of X is called an equivalence class of E (or an E-class) in X if 
C = E(a) for some a in X. The set of all equivalence classes of E in X is 
called the quotient set of X by E and is written X/E. That is, 


X/E =(CC X|C = E(a) for some ae X)}. 


For example, let X = Z, and let n be a fixed positive integer. Let 
x,y € Z. Suppose x = y means x = y (mod n); that is, n divides x — y. 
Then = is an equivalence relation on Z [Example 2.1(d)}. The set of 
equivalence classes Z/=, denoted by Z, or Z/(n), is the set 


(0,1,...,(” _ 1)}, 
where 


0 = (...,—2n,— n,0,n,2n,...}, 
L=(...,-2n + l—-at 1,lat 1,20 + 1,...}, 
2 = (...,-2n + 2,—n + 2,2,n + 2,2n + 2.,...), 


and so on. 

The equivalence classes [i.e., the elements of Z,, or Z/(n)] are called the 
residue classes of integers mod n. 

We observe, in this example, that integers a,b belong to the same 
equivalence class if and only if they differ by a multiple of n, and any two 
equivalence classes are either the same or disjoint. The following theorem 
shows that this property holds for every equivalence relation. 


2.2 Theorem. Let E be an equivalence relation ona set X. Then X/E is 
a partition of X. Conversely, given a partition n of X, there is a unique 
equivalence relation E such that X/E = 1. 
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Proof. For any a € X, a E a; hence, a € E(a). Suppose a € E(b). Then 
a E b; hence, for every x € F(a), x Ea,aE 6, and, therefore, x € E(b). 
Hence, E(a) C E(b). By symmetry, b E a; hence, E(b) C E(a). Therefore, 
E(a) = E(6). This proves that each a in_X belongs to exactly one equiva- 
lence class, namely, E(a). Hence, by Theorem 1.4, X/E isa partition of X. 

Conversely, let 2 be a given partition of X. Define the relation E as 
follows. For any x,y € X,x E yifand only if x,y are in the same block of z. 
Clearly, E is an equivalence relation on_X, and the equivalence classes of E 
are the blocks of x. Hence, X/E = x. Suppose E’ is also an equivalence 
relation on X such that X/E’ = x. Then for all x,y € X, 


x E y= x,y are in the same block of z 
=x E’ y. 


Hence, E = E’. Therefore E is the unique equivalence relation on X such 
that X/E=27. O 


Problems 


1. Let A = [0,1), B = (0,3,5). Draw the graph of A X B in the xy- 
plane. Is A X B= BX A? 

2. Ifeither A or B = ©, whatis A X B? If A X B = ©, what can you 
say about A and B? 

3. Suppose XC A and YC B. Show that X X YC A XB. If for 
given sets X, Y,A,B, we have X X YC A X B, does it necessarily 
follow that X C A and YC B? 

4. Let Ube the set of all people. Which of the following relations on 
U are equivalence relations? 

(a) isan uncle of 
(b) isa roommate of 
(c) isa friend of 

5. Describe the equivalence relation corresponding to the following 
partition of Z: 

(...,— 8,—4,0,4.8,...} U (...,—- 7,—- 3, 1.5...) U (...,-6,— 2,2,6,...} 
U (...,—5,— 1,3,7,...}. 


6. Show that the relation defined on N X N by 
(a,b)~(c,d) iff atd=b+c 
is an equivalence relation. 


7. Let Sbea nonempty set, and let R bea relation on S. Suppose the 
following properties hold: 


(i) Forevery xE S,x Rx. 
(ii) For every x,y,z € S,x R yand y R z imply z R x. 
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Prove that R 1s an equivalence relation. Also, prove that any 
equivalence relation satisfies (1) and (11). 

8. Determine the fallacy in the following argument, which attempts 
to show that a symmetric and transitive relation R on a set X 
implies that the relation is also reflexive: “By symmetry x R y 
implies y R x; by transitivity x R y and y Rx imply x R x.” 
[Hint: A relation R is a subset of X X X. If R= ©, then R is 
symmetric and transitive but not reflexive. ] 

9. Define a relation ~ in the set of integers Z as follows: a ~ b 
iff a + b is an even integer. Is ~ (a) reflexive, (b) symmetric, 
(c) transitive? If so, write down the quotient set Z/~. 


3 Mappings 


Definition. Let A and B be sets. A relation {from A to B is called a mapping 
(or a map or a function) from A to B if for each element x in A there is 
exactly one element y in B (called the image of x under f) such that x is in 
relation fto y. 


If fis a mapping from A to B, we write 
fA>B or ALB. 


Letf: A — B. Thesets A and Bare called, respectively, the domain and 
codomain of the mapping f Let x € A. If y is the image of x under /, we 
write {(x) = y and say that ftakes x to y; in symbols, 


xy 


or simply x > y. Again, if f(x) = y, then x is called a preimage of y. 

The graph of a mapping f: A — B is defined to be the set G = ((x,y) € 
A X B\ f(x) = y}. As a matter of fact, G is (the relation) / itself interpreted 
as a subset of A X B. 

Two mappings f: A — B and g: C— D are said to be equal, written 
f=e2,if A=C, B= D, and f(x) = g(x) for all x € A. 

We emphasize that if the mappings fand g have different codomains, 
they are not equal even if their domains are equal and f(x) = g(x) forevery 
x in the domain. 

A mapping f: X — X such that f(x) = x for all x © X is called the 
identity mapping on X and is denoted by i, (or /,). 

Let A bea nonempty subset ofa set XY. Then the mapping f: A — Xsuch 
that f(a) = a for each a € A is called an inclusion (or insertion) map of A 
into X. 
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Let fA — B. The subset of B consisting of every element that is the 
image of some element in 4 is called the image (or range) of the mapping 
fand is denoted by Im / That is, 


Im f= (y € Bly = f(x) for some x € A}. 


Let X and Sbe sets. The set of all possible mappings from X to Sis denoted 
by S*. That is, 


S*=(flf.X — S). 


Suppose X and S are finite sets having m and n elements, respectively. 
Consider any mapping f- X — S. The image of any given elementin X can 
be any one of the n elements in S. Therefore, the m elements in X can be 
assigned images in n Xn X +++ Xn=n"™ ways. This just means that 
there are exactly n” distinct mappings from X to S. Thus, for finite sets X 
and S, we have the result that |S*| = |.S]#!. 

For example, let X = (0,1). Then there are 2? = 4 distinct mappings 
from X to X: 


f'iX—7X  withOeo0, 10, 
f.X2X  withO 0, 161, 
fp.X 7X withOel, 10, 
fr.X 7X withOel, lod. 


We observe that if X is empty, there is just one mapping X — S, 
namely, the empty mapping in which there is no element to which an 
image is to be assigned. This might seem strange. but the definition of a 
mapping is satisfied vacuously. Note that this is true even if S is also 
empty. 

On the contrary, if. Sis empty but X is not empty, then there can be no 
mapping from X to S. 

Let f: A — Band g: B — C be mappings. Then the mapping 4: A ~ C 
given by 


h(x) = g(f(x)) for all x € A 


is called the composite of f followed by g and is denoted by g ° f(or more 
commonly by gf). Thus, g° f(x) = g(/(x)) for all x € A. 

The mappings fand g are called factors of the composite h = go f. 

Note that the notation for the composite mapping 1s such that the order 
in which the mappings act is from right to left. In g° f, facts first. 

The composite g ° fis defined whenever the domain of g contains the 
range of f. 

The composite of mappings f: A — B, g: B — C can be represented by 
the diagram 


\ 


One can go from A to C directly, 
A“>C 
or along the path 


AD BSC. 


If the result is the same - that is, the image of any x € A is the same along 
either path—then we say that the diagram commutes. It is clear that the 
diagram commutes if and only if h = gof. 

In the sequel we denote the composite by gf (instead of ge /). 


3.1 Theorem. Let f: A — B, g: B— C, andh: C— D. Then 
h(gf) = (hg)f. 


Proof. Clearly, h(gf) and (/ig)f have the same domain A and the same 


codomain D. 
Let x € A. Then, by definition of the composite, 


Wg f Xx) = A(gf(x)) = W(g(s(>))), 
and 
(hg) f(x) = he(f(x)) = A(g(f(>)). 
Hence, A(gf Xx) =(2gf)f(x0) for every xin A; therefore, h(gf) =(hg)f O 


Definition. A mapping f: A — B is 
(a) injective (or one-to-one or 1-1) if, for all x,,x.€ A, 
X, FX. => f(x) FS(X2) 


or, equivalently, f(x,) =f(x.) = xX, =X; 
(b) surjective (or onto) if, for every yE B, 


y = f(x) for some x E A. 


A mapping that is injective and surjective is said to be bijective. If f- 
A — Bisa bijective mapping, we may write f: A = B. A bijective mapping 
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f:- A—A is called a permutation of A. An injective (surjective, bijective) 
mapping is also called an injection (surjection, bijection). A bijective 
mapping is also called a one-to-one correspondence. 


The following theorem characterizes injective, surjective, and bijective 
mappings. 


3.2 Theorem. Let f: A — B. Then 


(i) fis injective if and only if no element in B has more than one 
preimage. 
(ii) fis surjective if and only if every element in B has a preimage. 
(iii) fis bijective if and only if every element in B has exactly one 
preimage. 


Proof. Obvious. O 


3.3 Theorem. An injective mapping from a finite set to itself is 
bijective. 


Proof. Let f: A — A be injective. Let a € A. We shall find b € A such that 

f(b) = a, which proves that f is surjective and, hence, biyective. Now 

consider the effect of performing f repeatedly. Let us write f 2 for ff 
and, in general, ff---f (with n factors) as f". The elements a= 

f(a), f(a), f(a),... are all in a finite set A; hence, there must be repetitions. 
So assume that f(a) = f(a), where r > s, say. If s = 0 then choose 6 = 

f(a). Otherwise, because fis injective, 


fa) = f(a) => ff" (a) = ff" (a) = f(a) 
= fsa) => => f(a) =a=> f(b) =a, 


where b = f'~*—"(a). O 
We wish to remark that Theorem 3.3 does not hold for infinite sets. For 
example, let i N — N, defined by f(x) = x + 1, bea mapping from the set 


of natural numbers to itself. Then fis injective but not surjective. Why? 
Recall that i, denotes the identity mapping on X. 


Definition. Let fA — B. A mapping g: B — A is called an inverse of f if 
fe=ig and gfriy. 
If f has an inverse, we say f is invertible. 


If fis invertible, then its inverse, written f—', is unique. 
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3.4 Theorem. fis invertible if and only if f is bijective. 


Proof. Let f: A — B be invertible with g as its inverse. Then f(x) =f()’) = 
gof(x) = ef(y) = x=y. Therefore, f is 1-1. Further, if z€ B, then 
fg(z) = z, which implies that g(z) € A is a preimage of z under f. This 
proves fis onto. Hence, fis a bijection. 

Conversely, let f; A — B bea bijection. Define g: B — A as follows: Let 
b € Band define g(b) = a, where ais the unique preimage of b € Bunder 
f. Obviously, g is the inverse of f/ O 


Definition. Let f: A — B. A mapping g: B— A is 


(a) a left inverse of fifgf=i,, 
(b) aright inverse of f if fg = ig. 


Clearly, the inverse of f, if it exists, is a left inverse as well as a night 
inverse of 


3.5 Theorem. A mapping f: A — B is 


(i) injective ifand only if f has a left inverse, 
(ti) surjective if and only iff has a right inverse. 


Proof. Exercise. O 


We shall frequently come across mappings that are determined in a 
natural way - induced, as we say - by another mapping or by some 
inherent property of the domain or codomain. In the next section we 
describe some of these induced mappings. 


3.6 Examples of induced mappings 


(a) Let f. A — B. Let SC A and TC B such that f(x) € T for all x E S. 
Then finduces the mapping g: S — T given by g(x) = f(x) for all x E S. 
The mapping g is called a restriction of f. 
(b) There is an important mapping determined by a subset of a set. Let 
SC Xand let A = (0,1). Then S determines the mapping /;: ¥ — A given 
by 


Sx) = 1 ifxe S, 
=0 ifx€é S. 


The mapping f, is called the characteristic function of S (as a subset of X). 
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Conversely, given a mapping f: X — A, let 
S=(x€ XUf(o = I). 


Then it is clear that the characteristic function of S is the given mapping 
J. Moreover, Sis the unique subset of X with fas its characteristic function. 
This proves that the mapping yw: P(X) — A“, given by S > fe, is bijective. 

It is clear that we could have used the set 2 = (1,2) in place of A. Thus, 
there is a one-to-one correspondence between the set P(X) and 2”. Be- 
cause of this, the set P(X) is also written as 2°. 

(c) Let E be an equivalence relation on the set XY. Then E induces the 
surjective mapping p: X — X/E, with x > E(x), where E(x) is the equiva- 
lence class of x under E. The mapping pis called the canonical (or natural) 
mapping from X to the quotient set X/E. 

(d) Given sets S and 7, there are two natural mappings with domain 
S X T-namely, 


p:.SXT-S with (x,y)'> x forall (x,y) E SX T, 
and 
q:.SXT—T ~ with (xy) y for all (x,y) © S X T. 


The mappings p and gare called the projections from S X T onto Sand T, 
respectively. 
(e) Given mappings f: A — S and g: B— T, let us define 


@:AXB—-SXT ~~ with (x,») > (1(-),2(¥)) 
for all (x,y) E A X B. 


The mapping @ is called the cartesian product of fand g and is denoted by 
IX g or (fg). 
(f) Let f; A — B. Then f induces a mapping 


Sa: P(A) — A(B) 
given by 
SAS) = (y © Bly = f(x) for some x € S) 


for every SC A. Note, in particular, that f,(A) = Im f, f«(@) = ©, and 
for every singleton {a} C A, f4({a}) = (f(a)}. It is usual to drop the sub- 
script * and to denote the induced mapping by falso. That 1s, 


S(S) = {y € Bly = f(x) for some x € S). 


The context usually makes clear whether fdenotes the mapping A — Bor 
the induced mapping ¥(A) — ¥(B). The mapping f: A — B also induces 
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a mapping in the reverse direction, namely, 
S*: P(B) — P(A) 
given by 
S(T) ={xE Alf(x) ET) for every 7C B. 


Once again, the induced mapping f* is commonly written f—', so that 
S-\(T) = {x € A| f(x) € T). But it should be carefully noted that f~', used 
in this sense, is not the inverse of fas defined earlier. In fact, the induced 
mapping f*: P(B) — P(A) always exists whether fis invertible or not. 


Let S be a nonempty set and a positive integer. As usual, let n denote 
the set (1,...,2}. A mapping f: n — S is called a /ist (or an ordered set) of 
elements of S. The element /(i) is usually written f,, and the list is 
exhibited as (/,./5,...,f,,). The positive integer 7 is called the length of the 
list, and /,,..., f, are called members or elements of the list. A list of length n 
is also called an n-tuple. 

A mapping f: N — Sis called a sequence of elements of S and is written 


Uf, Jim 1,2... or Sf, SA jude where /; = f(i), aS before. 


More generally, a mapping f: X — Sis called a family of elements of S 
and represented as (/; );- y. The set X is called the index set of the family /, 
and fis said to be indexed by X or X-indexed. 

One may wonder why we introduced the new term “family,” since, in 
general, there seems to be no difference between a family and a mapping. 
The difference lies in the point of view. When we refer to the mapping /: 
X — S§ as a family, we think of the elements of X as labels for locating 
elements of S. One may visualize a set of pigeonholes or post office boxes, 
each labeled with exactly one element of X. In each box we put exactly one 
element of S (with repetitions allowed; that is, we may put the same 
element in several boxes), /, being the element in the box labeled /. Thus, 
we see that the elements of the index set X serve as addresses for the 
locations of the elements of S in the range of 

Let (A,);-x be an arbitrary family of sets indexed by X. The cartesian 
product of (A,),- x, denoted by I,_yA,;, 1s defined as 


I] A,= {f x— 'S) A, 
tex ‘eX 
Note that each element of [,<yA, is a family (a,),-¥, where a, € A,. 

In the particular case when A, = A for each i €_X, the cartesian product 
Nex; is in fact A*. ; 

The cartesian product of a finite family (that is, a list) of sets 
A,, Apd,...,A, is written I7_,A,; or A, X A, X *** XA, 


S(t) € A, for each 1 € x}. 
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Problems 


4 


—, 
° 


aie ass 


Prove Theorem 3.2. 

Let f: A — Band g: B — C be mappings. Show that 

(a) gfis injective if both fand g are injective. 

(b) gfis surjective if both fand g are surjective. 

(c) gfis bijective if both fand g are bijective. 

If fg is injective (or surjective), what can be said about fand g? 
Prove Theorem 3.5. 

Let f: A — Band g: B — A be mappings such that gf= /,. If fis 
surjective or g is injective, show that fand g are bijective. Then 
show that fg = /,. 

Let C[0,1} be the set of all continuous functions from the closed 
interval (0,1) to R, and let S be the set of all differentiable func- 
tions fE C[0,1} such that /(0) = 0 and /’ is continuous. Show 
that the mapping D: f> /’ is a bijection from C [0,1] to S. 
Find the number of injective mappings from a set with 772 ele- 
ments to a set with 7 elements if (a) ™ = 1, (b)m <n,(c)m> n. 
If fis a mapping from N to N defined by f(x) = 2x + 1, show that 
J has a left inverse but no nght inverse. Also show that f has 
infinitely many left inverses. 

Let S be a set with m elements. Find the number of distinct 
mappings from S to S. How many are bijective? 


Binary operations 


Definition. A mapping 


* SxXS—>S 


is called a binary operation on the set S. 


A binary operation on S thus assigns to each ordered pair of elements of 


S exactly one element of S. Binary operations are usually represented by 
symbols like *,- ,+,°, instead of letters fg, and so on. Moreover, the image 
of (x,y) under a binary operation * is written x * y instead of *(.v,y). 


Addition (+) and multiplication (-) in the set Z of integers or, more 


generally, in the set R of real numbers are the most familiar examples of 
binary operations. We have come across a few others in this chapter, 
without having called them by that name. If 4 and Bare subsets of X, then 
AU B,AN B,and A — Barealsosubsets of Y. Hence, union, intersection, 
and difference are all binary operations on the set P(X). Again, given 


22 Sets and mappings 


mappings f: X — X and g: X — X, their composite gf is also a mapping 
X — X. Hence, the composition (°) of mappings is a binary operation on 
the set S = X* of all mappings from X to X. 

More generally, for any positive integer n, a mapping f: S” — S, where 
S"= SX SX ++: & S§(n factors), is called an n-ary operationon S. When 
n= |, we get a unary operation, u: S— S. 


Definition. A binary operation *: SX S—S on the set § is 
(a) commutative if 
x*y=yrx  forallxyeES, 
(b) associative if 
x*(y*zy=(x*y)*z forallxy,zeES. 
Ife is another binary operation on S, then * is 
(c) left-distributive over ° if 
x*(yez)=(x* y)e(x*z) ~~ forall x,y,zES, 
(d) right-distributive over if 
(yoz)*x=(y*x)e(z*x) = forall x,y,zES. 


If * is both left- and right-distributive over °, then * is said to be 
distributive over ». 


4.1 Examples 


(a) Addition (+) and multiplication (-) in Z (or, more generally, in R) are 
both commuiative and associative. Moreover, - is distributive over + (but 
the converse is not true). 

(b) Union and intersection in the set P(X) are both commutative and 
associative. Moreover, each is distributive over the other. 

(c) Composition (°) of mappings is an associative binary operation on 
the set X~* of all mappings from _X to_X. If X has more than one element, 
then © is not commutative. 

Let * be an associative binary operation on the set S. Then 
a*(b*c)=(a* b) *c forall a,b,c € S. Hence, we may omit the paren- 
theses and simply write a * 6 * c without ambiguity. One would intui- 
tively expect that the result can be generalized to any finite number of 
elements. This is indeed true. To prove it formally, we first introduce 
some notation. 

Let * be a binary operation (not necessarily associative) on S. Let us call 
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a* b the product of a,b in that order. Given a list a,,...,a, € S, we can 
form a product of the elements a,,...,a,, in that order, in several ways by 
inserting various parentheses and repeatedly applying the binary opera- 
tion *. But the parentheses must be suitably placed so as to get a meaning- 
ful product, that is, one without ambiguity. For example, if * is not 
associative, the expression a * b * c is ambiguous and, hence, not a 
meaningful product. The meaningful products of a,b,c in that order are 
(a*b)*c and a*(b*c). For four elements, a,b,c,d, there are five 
meaningful products: ((a * b) * c) * d, (a * (b* c)) * d, (a * b) *(c* d), 
a*((b*c)*d),a*(b*(c*d)). 

In general, let #(a, ,...,a,) denote any meaningful product of a, ,...,a, in 
that order. Then, clearly, A(a,,...,a,) = &,(@,,...,€,) * Po(Ay415-+4,) for 
some k, where | = k Sn and @,(a,,...,a,) and @,(a,4,,-..,€,) are them- 
selves meaningful products. 

The standard product of elements a,,...,a, in that order is defined 
inductively (see Theorem 1.7, Chapter 2) as follows: 


n n—l 
Ma=(T1 2) +2, form > 1. 


im] 


For example, the standard product of a, ,a,,a;,a,,a; in that order is 
(((a, * a>) a;) * a,) * as. 


4.2 Theorem. Let * be an associative binary operation on the set S. 
Given a list a,,....a, € S, every meaningful product ofa, ,...,a, in that order 
is equal to the standard product MV}. ,a,. 


Proof. We prove the theorem by induction on n. For n = | the result is 
trivially true. Let n > | and suppose the result holds for products of m 
elements for every m <n. Let @(a,,....a,) be any meaningful product of 
a,,...,a, in that order. Then A(a,,...,a,) = @,(Q, ...,4,) * B2(y4) 5---54,) for 
some k, where | Sk Sn and ¢,(a),...,a,) and $,(a,4,,...,a,) are mean- 
ingful products of a,,...,a, and a,,;,...,€2,, respectively. By the induction 
hypothesis, we have 


k 
d,(a, y-+-9) 7 Il a;, 
f=] 


n—k 
PY Ag4 1 9--+1n) = Il Ans): 
j=l 
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Hence, 


P(A, ,.--52n) = (I a) ‘(i ta) 


(= 


Iifk=n-—1, then 


da, 5++43Qn) os 


Now 


(i) + (i) 


is a meaningful product of n — | elements; therefore, by the induction 
hypothesis, it is equal to M7>a,. Therefore, once again 


(a, ,...,4,) = (TT a) *a,= Il a; O 


i=] im) 


Theorem 4.2 is known as the generalized associative law. Asa result of 
the theorem, if * is an associative binary operation on S, then given a list 
a,,...,a, & S there is a unique product of a,,...,a, in that order that is 
written a, * «-*: * a, (without any parentheses). 


Definition. Let * be an associative binary operation ona set S. Letae S 
andn€N. We define inductively the power of an element a as follows: 


a'=a, a"*t!=(q") * aq. 


The validity of this definition follows from the Recursion Theorem 
(Theorem 1.7, Chapter 2). 


43 Theorem. Let * be an associative binary operation on S. Let 
a€ S, and let m,n be positive integers. Then 
(i) a™*ar=antn 


(ii) (a"yv=a™. 


Cardinality of a set 25 


Proof. Follows from the definition of a” and generalized associative 
law. O 


Problems 


1. Determine if the binary operation ° on R is associative (commu- 
tative) in each of the following cases: 
(a) xey=x*y 
(b) xe y= min(x,y) 

(c) xo y=x" + y* 
(d) xey=1 

2. Let S be a nonempty set with associative binary operation o. Let 
x,y,z © S. Suppose x commutes with y and z. Show that x com- 
mutes with yo z also. 

3. Let Sbe a nonempty set with associative binary operation e. Let 
a € S. Show that the binary operation © given by x° y= xeacy 
is also associative. If ° is commutative, is o commutative? 

4. Let Sbea set with elements. Find the number of binary opera- 
tions on S. 

5. How many binary operations in Problem 4 are commutative? 

6. Suppose S is a nonempty set with associative binary operation e. 
Suppose e, fE S such that x° e = x and fe x = x for all x ES. 
Prove that e=/. Furthermore, if x° y= e= ze x, show that 
y=z. 


§ Cardinality of a set 


Earlier in this chapter when we defined a finite set as having a finite 
number of elements, we relied on our intuitive notion of counting. We 
shall now make this notion precise and extend it to arbitrary sets. 

Let A,B be sets. If there exists a bijective mapping f: A — B, we say that 
A is equipotent to B and write A = B. It is clear that equipotence is an 
equivalence relation (on any given collection of sets). This enables us to 


assign to every set Sa cardinal number|S|in accordance with the following 
definition. 


Definition. Let A and B be sets. If there exists a bijection f: A — B, then A 
and B have the same cardinal number (or cardinality) |A| =|B|. 


If mand nare natural numbers, it can be easily proved that the sets m = 
(1,....m) and n = (1,...,2) are equipotent if and only if m = n. We use this 
fact to define a finite set and its cardinality. 
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Definition. Let S be a set. If S is equipotent to the set {\,...,n} for some 
n EN, then S is called a finite set of cardinal number n. An empty Set is a 
finite set of cardinal number 0. 


A set that is not finite is called an infinite set. The cardinal number of an 
infinite set is called a transfinite cardinal number. An infinite set S is said 
to be countable (or denumerable) if S is equipotent to the set N. The 
cardinal number of N (and hence of every countable set) is denoted by Xp. 
(Some authors call a set countable if it is finite or equipotent to N. The 
letter &, pronounced aleph, is the first letter of the Hebrew alphabet.) 

We shall prove that Z and Q are both countable sets; hence, |Z| = |{Q| = 
X,. But the set R of real numbers is uncountable. We shall prove that, ina 
sense to be explained later, the cardinal number c of R 1s strictly greater 
than Xo. 

The usual ordering of natural numbers gives an ordering of the cardinal 
numbers of finite sets. We extend this ordering to arbitrary (finite or 
infinite) sets. 


Definition. Let A and B be sets. If there exists an injective mapping f- 
A — B, then|A| is said to be less than or equal to |B| (written |A| Ss |B). 


For cardinal numbers of finite sets, the relation s coincides with the 
usual ordering of natural numbers. Moreover, it is clear that for cardinal 
numbers of arbitrary sets the relation s is reflexive and transitive. For any 
set A the identity map A — A is injective; hence, || s|A|. If mappings /: 
A — Band g: B— C are both injective, then gf: A — C is also injective. 
Hence, |A| = |B}, |B| =|C| imply |A| s|C). The following theorem shows 
that s is also antisymmetric and, therefore, a partial order on the set of 
cardinal numbers. 

For any mapping @: X — X, let", n € N, have the usual meaning [i.e., 
d"=dede ::> od (n times)}, and let @° denote the identity mapping. 


5.1 Theorem (Shréder-Bernstein). Let A and B be sets, and let f- 
A — Band g: B —A be injective mappings. Then there exists a bijection y: 
A —B. 


Proof. If fis not surjective, let C = B —Im/ 
Set 


B* = ((fey(c)lcE C, n= 0) 
and 
A* = 9(B*) = (g(b)|b € B®). 


Cardinality of a set 27 


The restriction of g to B* is obviously a bijection B* — A*. Therefore, 
every ain A* has a unique preimage g~ '(a) in B*. Consider the mapping 
yw: A — B given by 


wa) = g~ (a) if ae A*, 
= f(a) if a ¢& A*. 


We claim that wis bijective. We first prove that a € A* ifand only if f(a) € 
B*, Leta € A. Ifa € A*, then a = g(b) forsome b =( fg)"(c), where c € C, 
n= 0. Hence, f(a) = fe(b) = (fg)"* (c) € B*. Conversely, if f(a) € B*, 
then f(a) = (fg)"(c) for some c € C and n > 0. Hence, a = g( fg)" ‘(c) = 
g(b), where b = (fg)"~(c) € B*. Therefore, a € A*. 

Let a,a’ € A and y(a) = y(a’). If y(a) = y(a’) € B*, then a,a’ € A*; 
hence, g~'(a) = g~'(a’), which yields a =a’. If y(a) ¢ B*, then a,a’ ¢ A*. 
Thus, 


y(a) = y(a’) = f(a) = f(a’) > a= a’. 


This proves y is injective. To prove y is surjective, let b € B. 

Ifb € B*, then a = g(b) € A*; hence, y(a) = g~'(a) = b.1fb € B*, then 
b ¢ C because C C B*. Hence, b = f(a) for some a € A. Since f(a) ¢ B*, 
a ¢ A*; hence, y(a) = f(a) = b. Therefore, y is surjective and, hence, 
bijective. O 


We saw earlier that if X is a finite set with 1 elements, then P(X) has 2” 
elements (Theorem 1.3). Hence, |X|< |P(X)|. The following theorem 
shows that this result holds for any set. (As usual, |.A| < |B|means that|A|s 
|B\ and | A] #|B].) 


§.2 Theorem. For any set X, |X| <|PA(X)|. 


Proof. The mapping j: X — P(X) given by j(x) = (x) for all xE X 1S 
obviously injective. Hence, |X| = |P(X)|. To prove that |X| #|A(X)|, we 
have to show that there is no surjective (and hence no bijective) mapping 
from X to P(X). Since we have already proved {Example 3.6(b)] that 
P(X) is equipotent to (0,1)}*, it will suffice to prove that there is no 
surjective mapping from_X to (0,1)}*. 

Consider any mapping f: X — (0,1)*. For any x in X, let f, denote the 
image of x under the mapping f. Then f,, being an element of (0,1)*, is a 
mapping f,: X — {0,1}. Consider now the mapping g: X — {0,1} given by 


g(x’) =0 ffx) = 1, 
= | if f(x’) = 90, 


for all x’ € X. Thus, g € {0,1)*, but g ¢ Im f Hence, the mapping f: X > 
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(0,1}* is not surjective. This proves that |X| # |P(X)|. Therefore |X| < 
|A(X)|. 0 


For any set X let us denote the cardinal number of P(X) by 2'*!. This 
notation is in agreement with the equality |P(X)|= 2'*! for finite sets 
(Theorem 1.3). 

We shall now prove the important result that the set of real numbers is 
not countable. 


§.3 Theorem. The cardinal number of the set R of real numbers is 
¢ = 2Xo, 


Hence, R is not countable. 


Proof. Any mapping f: N — (0,1) determines an infinite decimal 


x, = O° f(D SSB). 


Then f+ x;is a mapping ¢: (0,1)% — R, which is clearly injective. On the 
other hand, any real number x can be expressed in the binary scale in the 
form 


Xe XX 4X XQX4XGQer', 
where x, = 0 or | for every n € N. Hence, x determines a mapping 
g,: N — (0,1) with n>xX,. 
Thus, we have an injective mapping 
yw; R—(0,1)8  givenby x+>g,. 
Therefore, by Theorem 5.1, R is equipotent to (0,1), hence, 
¢ =|R|=[(0,1)8] = 2%. 
By Theorem 5.2, |N| = X) < 2%, so there is no bijection from N to R. 


Therefore, R is uncountable. © 


We next prove some properties of countable sets. It follows 1mmedi- 
ately from the definition that a set S is countable if and only if S= 
(S, ,S>,.-.}, where the S; are all distinct. If S =(S, ,S5,...}, but the S; are not 
all distinct, we can delete all repetitions in the sequence (S, ,S3,...} and be 
left with either a sequence or a finite list. Hence, S is either countable or 
finite. Put differently, if there is a surjective mapping N — S, then S 1s 
either countable or finite. 
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5.4 Theorem. 4 subset of a countable set is countable or finite. 


Proof. Let S be a countable set, and let 7c S. If T is empty, there is 
nothing to prove. Suppose 7 is nonempty and let a € 7. Because S is 
countable, there is a byection /: N — S that induces a surjective mapping 
g: N — T given by 


a(ny=f(n) if f(NhE T, 
=q if f(n) € T. 


Hence, 7 is either countable or finite. O 


5.5 Theorem. The union of a sequence of finite sets is countable or 
finite. 


Proof. Let {A,B,...) be a sequence of finite sets, and let Sbe their union. Let 
A = (4,,@,...,a4)}, B = (b, ,b3,...,6,}, and so on. Then 
S= {41,@2,...,€,,b,,b2,...,b,,...}. 


Thus, there is a surjective mapping N — S, so Sis countable or finite. O 


5.6 Theorem. Let A and B be countable sets. Then AU Band AX B 
are countable. More generally, the union and cartesian product of a finite 
number of countable sets are countable. 


Proof. Let A = (a,,Q,...}, B = (5, ,bp,...}. Then A U B = {a,,b, ,a,,b),...)}. 
Because A C A U B, A U Bis not finite. Hence, A U B is countable. 
The set A X Bcan be partitioned into finite subsets 


S,=((a,b)litj=pt0),  p=1,2,... 


Hence, by Theorem 5.5, A X B is countable. 
The more general result fo:lows by induction on the number ofsets. O 


§.7 Theorem. Z and Q are countable sets. 

Proof. Z= NUN’, where N’ = (0,— 1,—2,...). Hence, by the previous 

theorem, Z is countable. Therefore, Z < N is countable. The mapping 
d:ZXN—>Q7 given by d(a,b) = a/b 


is surjective. Hence, Qis countable. O 


CHAPTER 2 


Integers, real numbers, and 
complex numbers 


1] Integers 


In this section we give some properties of integers that the reader will 
encounter throughout the book. We do not intend to give here an axio- 
matic development of this topic (we do this in Chapter 13); rather we 
assume our familiarity with addition, multiplication of integers, and the 
usual properties of these operations. 

We first give some definitions and notation. An integer 5 is a factor or 
divisor of an integer a (or a is a multiple of b) if there exists an integer c 
such that a = bc. We say that b divides a and write bla (bYa means 5 does 
not divide a). A nonzero integer pis called a prime if p # + | and the only 
factors of p are +p and + |. Let a be an integer. The absolute value (or 
modulus) ja] of a is defined as follows: 


a ifa> 0, 
la|= 0 if a = 0, 
—a ifa<0O. 


The set of integers is denoted by Z, and the set of positive integers (also 
called natural numbers) by N. 

We now state the well-ordering property of positive integers, which 1s 
proved in Chapter 13. 

If S is a nonempty subset of N, then S contains a smallest element 
(also called \east element). 

This property along with our assumed familiarity with addition and 
multiplication of integers provides a basis for proving a number of other 
important properties of Z. 
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We now derive some consequences of the well-ordering property of 
positive integers. To begin, we derive 


1.1 First principle of induction 
Let S be a subset of N satisfying the following properties: 


i) 1ES. 
(ii) IfaES,thenatl1eEs. 


ThenS=N. 


Proof. Let T = (x € N\x € S}. If T = ©, we are done. So let T # ©. Then 
by the well-ordering property of positive integers, 7 has a smallest element 
m. Clearly, m # 1 because | € S. Then, by (ii), m — 1 € S, which contra- 
dicts the minimality of m. Thus, T = ©, and, hence, S=N. OQ 


Next we derive 


1.2 Second principle of induction 
Let S be a subset of N such that 


(i) 1E€S, and 
(ii) nE&S whenever me S for all positive integers m <n. 


Then S=N. 


Proof. Let T = {x E€ N\|x € S}. If T = ©, we are done. Otherwise by the 
well-ordering property 7 contains a smallest element m. By (1), m # I. 
Then for all positive integers x < m, x € S. But then by (11), mE S, a 
contradiction. Therefore 7=0. O 


One of the important results that follow from the second principle of 
induction is 


1.3 Theorem (fundamental theorem of arithmetic). Every positive 
integer is either | or it can be written in one and only one way as a product 
of positive primes. 


Proof. The theorem is true ifm = 1.Soletn > 1. If nis prime, we are done. 
Else, let nm =7n,n,, where 7,,n2> 1, so n,, n2<n. Assume that the 
theorem is true for all positive integers <n. We shall prove that it is true 
also for n. Now, by the induction hypothesis each n, ,n, can be written 
uniquely as a product of positive primes. Thus, 7 can be written as a 
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product of positive primes. We proceed to prove that the factorization so 
obtained is unique. Let 


n =P, Pr°°*’ Ds = 4:92°°° q> 
where p, and q; are positive primes. If p, = q,, then 


= py P= USN. 

P; 
But, by the induction hypothesis, this implies s— 1 =¢-— 1, and by 
renumbering the subscripts (if necessary) p, = q;, i = 2,...,5. This proves 
the uniqueness of prime factorization if p, = q, (or if some p; = q,). So let 
p, * q,;. For definiteness, let p, > g,. Then 


n> (DP, — 9)P2°** Ps = Di Pr *** Ds — GW Pr *** Ds 
= 9\42°** Gy — M1 D2°** Ds = (92 °** Gs — D2 *** Ds): 


By the induction hypothesis either g,=p, for some [= 2,...,5 or 
g,\(P, — 4). If ail(p, — 4,), then q, = p,, a contradiction. Thus, q, = p; 
for some 2 = i = s, which reduces to the case considered in the beginning. 
The proof of the theorem then follows by the second principle of induc- 
tion. O 


Next we prove 


1.4 Theorem (division algorithm). Let a,b € Z, b>0. Then there 
exist unique integers q and r such that 


a=bqtr, O<r<b, 


where q is called the quotient, and r the remainder of a modulo b. 


Proof. Let S = (a — bx|x € Z). S has a positive element a — b(—|a|). By 
the well-ordering property of positive integers, S contains a smallest posi- 
tive integer, say, m= a— bx. Ifm> b, then m— b=a— b(x+ IDES, 
which contradicts the minimality of m. Thus, 1 s b. 

If m = b, then a — bx = bimplies a = b(x + 1); therefore, choose q = 
x + 1 and r= 0. If m < 5, then choose g = x and r = m. For uniqueness 
of qg and r, let 


a=bqt+r=bq' +r’. 
Then 
b(q—q')=(r' —r). 


Because |r—r’|< b, g—q’ =0. Thus, g=q’, which gives, in turn, 
r=r’, O 
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Definition. Let a,b € Z. Then d is called a greatest common divisor 
(g.c.d.) of a and b if the following hold: 


(i) dlaand d\b. 
(ii) Ifce€ Z, cla, and c\b, then c\d. 


Remark. dis unique to within the factors + |. The positive g.c.d. of a and 
b is usually denoted by (a,b). 


1.5 Theorem. Let a and b be two nonzero integers. Then 


(i) a,b havea positive g.c.d., sayd: 
(ii) there exist integers m,n such that d = am + bn. 


Proof. Let S = {ax + by|x,y € Z). Then obviously S contains positive 
integers. By the well-ordering property, let d = am + bn be the smallest 
positive integer in S. We prove that d is the g.c.d. of a and 5. 

By the division algorithm a = dg + r, where 0 = r < d. Then 


r=a—dq=a-—(am + bn)q = a(1 — mq) — bngeS, 


which contradicts the minimality of din S unless r = 0. Hence, r = 0 and 
d\a. Similarly, d|b. Further, let c € Z, cla, and clb. Then d= am + bn 
implies cid. Hence, d= (a,b). O 


1.6 Corollary. Let p be prime and a,b € Z such that p\ab. Then p\a 
or p\b. 


Proof. Suppose pfa. Then (p,a) = |. This implies that there exist u,v € Z 
such that up + va = |. Then upb + vab = b; so p{b since pjab. O 


Sometimes a mapping f: N — S is described inductively; that is, by 
giving /(1) and prescribing a rule that determines f(” + 1) when f{(n) is 
known. For example, the standard product I17_, a, was defined in this 
manner in the last chapter (Section 4). Another example is the definition 
of the factorial function: 1!=1, (n+ 1)!=—(n+1)-n! for all ne 
N. Now the question 1s whether this is a valid procedure for describing a 
mapping. In other words. does there exist a unique mapping 
Sf: N — S satisfying the conditions stated earlier? Intuitively, the answer 
seems to be yes. We prove it formally in the following theorem. 


1.7 Theorem (recursion theorem). Let S be a set anda ES. Givena 
mapping: N X S — S, there exists a unique mapping f N — S such that 


S(\=a and f(n+ 1) = dn, f(n)) for alln EN. 
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Proof. We first show that if a mapping satisfying the given conditions 
exists, then it is unique. Let fg: N— S be two such mappings. Then 
J(1) = a= g(1). Further, if f(n) = g(n) for any n EN, then f(n + 1) = 
d(n, f(n)) = &(n,g(n)) = g(n + 1). Hence, by the induction principle, 
J(n) = g(n) for all n EN. 

To prove the existence of f, let 4 be the set of all subsets X of N X S 
satisfying the following conditions: 


(1,a) € X; and if (n, x) € X, then (n + 1, d(n,x))E X forallnEe N. (*) 


Clearly, the set N X< S itself satisfies (*); hence, Af is nonempty. Let R = 
Nyea {X } be the intersection of all sets in 44. Then R satisfies (*); hence, 
Re M. Let P be the set of positive integers n for which there exists a 
unique element x, in S such that (n, x,) € R. If | € P, there exists b € S, 
b # a, such that (1,5) € R. But then R — {(1,5))} satisfies (*); hence, R C 
R — {(1,5)}, a contradiction. Therefore, | € P. 

Now let n € N and suppose 7 € P, but n+ 1 ¢ P. Then there is a 
unique x,€S such that (n,x,)€R; hence, by condition (*), (7+ 
| ,d(n,x,)) & R. Since n+ 1 ¢ P, there exists cE S, c# d(n,x,), such 
that (n + I.c) € R. Again, it is easily verified that R — {(n + 1I,c)} satisfies 
(*); hence R C R — {(n + 1,c)}, a contradiction. Therefore, if € P, then 
n+1€ P. Therefore, P = N. So, for every n € N there is a unique ele- 
ment x, € S such that (7, x,) € R. 

Now (,x,) € R=(n + 1,d(n,x,)) € R; hence, by uniqueness of 
Xn+1, O(N, X,) = X,4,. Therefore the mapping f/f: N — S given by f(n) = 
xX, Satishes the required conditions, namely, f(1) =a and f(n + 1) = 
dn, f(n)) for allnEN. O 


Problems 
1. Prove by induction the following summation formulas: 
(4n? — | 
(a) 127+3?7+:-:-+(2n— 1) oo 


2 
(b) Pa Deg m= | MED! 


2. Prove by induction that 


d" ee l 
rae as ale erates 

3. Let A,,...,4, be subsets of a set. Prove by induction the following 
laws: 


(a) (A 4) = U A. 


2 
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(b) (0 4) = (") Aj. 

i=l =| 
If a,b,c E Z such that ajb and alc, show that almb + nc, for 
mneé& Z. 
Let a,b € Z. The least common multiple (l.c.m.) of a and 5, 
denoted by [a, 5], is defined to be a positive integer d such that 
(i) ald and bid, 
(ii) whenever a|x and b|x, then d|x. 
Suppose a = pf +: pftandb= p# «++ p*be factorizations of a 
and b as a product of primes, where e,, f, = 0. Prove that 
(a) [a,b] is unique. 
(b) [a,b] = pf -*: pf, where g, = max(e,, f,). 
(c) [a,b] = ab/(a,b), where (a,b) is the greatest common divisor 

of a and b. 

show that any integer 7 > | is either a prime or has a prime factor 
SV”. 
Show that the polynomial 


XOXO oe ey Xk Pa, 


where a, € Z, has no rational root that is not integral. Also show 
that any integral root must be a divisor of a,. 

Prove Wilson’s theorem: For any prime p, (p— 1)!+1 20 
(mod p). 

Prove Fermat’s theorem: For any a, p € Z with p prime, a? = a 
(mod p). 

Let 1 be a positive integer, and let d(m) denote the number of 
positive integers less than and prime to 7. Prove Euler’s theorem: 
For any integer a relatively prime to n, a” = 1 (mod n). (¢ is 
called Euler’s function). Furthermore, note that for a prime 
p,@(p)=p—1. Thus, Euler’s theorem generalizes Fermat's 
theorem. 


Rational, real, and complex numbers 


We assume that the reader is familiar with rational and real numbers and 
the usual operations of addition and multiplication in them. Constructing 
the rational numbers from integers is given in Chapter 12. However, the 
construction of real numbers is not given because it is normally done in 
books on analysis (see, for example, Landau 1960). 

A real number may be taken to be a (terminating or nonterminating) 
decimal. A terminating or a recurring decimal is a rational number that is, 
equivalently, a fraction of the form a/b, where a,b € Z and b # 0. 

The system of complex numbers is defined to be the set C= RXR 
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(where R is the set of real numbers) with addition and multiplication given 
by 


(a,b) + (c,d) =(a+c,b+d), 
(a,b) : (c,d) = (ac — bd, ad + bc). 


It is easily verified that addition and multiplication in C are both 
associative and commutative and, moreover, that multiplication is 
distributive over addition. For any (a,b) € C, 


(a,b) + (0,0) = (a,b) = (a,b) 1,0), 
(a,b) + (— a,— b) = (0,0). 


Furthermore, if (a,b) # (0,0), 


—b 
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Thus, addition and multiplication in C have the same properties as the 
corresponding operations in R (or Q), with (0,0) and (1,0) playing the 
roles of 0 and 1, respectively. 

Consider complex numbers of the form (x,0). Addition and multipli- 
cation of two such complex numbers give 


(a,0) + (6,0) = (a + 6,0), 
(a,0) « (6,0) = (ab,0). 


This allows us to identify every complex number (.x,0) with the real 
number x and thereby treat R as a subset of C. Let us henceforth wnite x for 
(x,0). Then for any (a,b) € C, 


(a,b) = (a,0) + (0,1)(5,0) 
=a-+ ib where i = (0,1). 


Now i* = (0,1) -: (0,1) = (— 1,0) = —1. 
Thus, every complex number can be wnitten as a + ib, where a and b 
are real and i? = — |. This is the common notation for complex numbers. 
Let z=a+ibe€C, where a,b € R. Then a and BD are called the rea/ 
part and the imaginary part of z, respectively. The complex number 
z =a -— ib is called the conjugate of z. The nonnegative real number 
\z| = + Va? + b?2 is called the modulus of z. 


3 Fields 


Introductory algebra is concerned with the operations of addition and 
multiplication of real or complex numbers as well as the related opposite 
operations of subtraction and division. On the other hand, abstract alge- 
bra is concerned with deriving properties of an abstract system (such as 
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the systems of integers, real numbers, or complex numbers) from its 
axioms in a formal, rigorous fashion. The purpose of this section is to 
introduce the richest algebraic structure, for which the systems of rational, 
real, and complex numbers are concrete examples. 


Definition. A system (F, +, -), where F is a nonempty set and +,: are 
binary operations on F, is called a field if the following axioms hold: 


(i) The additive operation is associative: (x + y) + z=x+(y+t 2). 
(ti) F has a zero element 0 such thatx+0=x=0+4+ x. 
(iti) Every element x € F has a negative —x€F such that x+ 
(—x)=0=(—-x)+ x. 
(iv) The additive operation is commutative: x + y= y+ x. 
(v) The multiplicative operation is associative: (xy)z = x(yz). 
(vi) Fhas 1#0 such that x1 =x = 1x. 
(vii) Every element 0#x€ F has an inverse x~'€ F such that 
xx! = | = x7! 
(vii) The multiplicative operation is commutative: xy = yx. 
(ix) The multiplicative operation distributes over the additive opera- 
tion: x(y + Zz) = xy + xz. 


Examples of fields are the systems of rational numbers, real numbers, 
and complex numbers under the usual operations of addition and multi- 
plication. The system of integers Z is not a field under the usual operations 
because each nonzero integer is not invertible in Z. Z is an example of 
another algebraic structure —““commutative ring” — discussed later in the 
book. 

Note that although familiar statements such as 


(a) x0=0 
(b) (—x)y = —(xy) = x(— y) 
(c) x(y—z)=xy— xz 
are not included in the axioms, they can be easily deduced from them. For 
example, to prove (a) consider 
x0 = x(0 + 0) [axiom (11)] 
= x0 + x0 [axiom (1x)} 
By axiom (111), — (x0) € F. Thus, 
xO + (—(x0)) = (xO + x0) + (— x0) 
= x0 + (x0 + (—x0)) [axiom (1)}; 
so 0 = x0. 


Not every field is infinite. We close this section by giving an example of 
a finite field. 
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3.1 Example (integers modulo n). 
Let n be a fixed positive integer, and let 


Z,, or Z/(n) = {0,1,...,(2 — 1)) 


be the set of equivalence classes of integers mod n (Section 2, Chapter 1). 
We define addition and multiplication in Z, as follows: 


xty=xt+y, xy=xy, (1) 
x, YE Z,. Since x = x + kn for all k € Z, the question arises: If x = x’ 
and y= y’, isittruethat x + y= x’ + y’andxy= x’y’?; in other words, 
are addition and multiplication in (1) well defined? We proceed to show 


that this is indeed the case. 
Now 


x =x’ and y= y’ = n divides x — x’ and n divides y — y’ 
=> n divides (x — x’) +(y'— y’) 
=> n divides (x + }') — (x’ + y’) 
= (x+ y)=(x' +y’) 
=x+ty=x'+y’,  by(I). 


This proves that addition in Z, is well defined. 
Similarly, 


x =x’ and y= y’ => n divides (x — x’ y — y’) 
= n divides (xy + x’y’) — xy’ —x’y 
= (xy — x’ y’) + x"(y' — y) + (Xx! — x) 
= n divides xy — x’y’ 
=> xy= x’y’ — xy = xy 
proving that multiplication is also well defined. 

That addition satisfies field axioms (1)-(iv) for (Z,,+, °) 1s trivial. The 
zero element is 0, and the negative of x is (— x). It 1s also trivial that 
multiplication satisfies axioms (v), (vi), (viii), and (ix); but axiom (vii) 
need not hold. However, for n = p, p prime, we proceed to show that 
axiom (vii) does hold. 

Let 0 # x € Z,. This implies that px. So there exist integers u and v 
such that pu + xv = 1 (Theorem 1.5). But then 

I = put+xv=1= put xv= put xv= put xv=0u+ x0 
=Ou+xv=0+ xv=xv. 
Hence x—' = v. Therefore, (Z,,+, -) is a field with p elements. 


We shall return to the algebraic structure (Z,,+, -), where 7 1s any 
positive integer, later in Chapter 9, for an example of a “‘finite ring.” 


CHAPTER 3 


Matrices and determinants 


1 Matrices 


Matrices originated in the study of linear equations but have now ac- 
quired an independent status and constitute one of the most important 
systems in algebra. They are a rich source of examples of algebraic struc- 
tures, which we shall study later. 

As usual, for any positive integer n, the set (1,...,7} will be denoted by n. 
The cartesian product m X n is therefore the set 


m Xn = ((1,/){0 = 1,...,715 7 = 1,...,7}. 


Definition. Let F be a field and m and n positive integers. A mapping 
AmxXn—F 


is called an m X n matrix over F. 


Let A bean m X n matrix over F, and let (i,j) € m X n. Then A(i,/), the 
image of the ordered pair (/,/) under the mapping 4A, is called the (/,/) 
entry (or element) of the matrix A and is denoted by 4,, or a,,, the latter 
being more common. When the (/,/) entry of A is written @,,, we say that 
A =(a,,). 

An m X n matrix A over F and a p X g matrix B over F are said to be 
equal, written A = B, ifm = p,n= q, and a,, = 5,, for each (i,j) € m X n. 
If A isan m X n matrix, A is said to be of size m X n. (Note that it is really 
the ordered pair (m,n) that is the size.) 
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An m X n matrix is usually exhibited by writing the mn entries a,,in a 
rectangular array as shown below: 


Qi, &y2 Qin 
Qa, 422 Qn 
Ami am? a Qmn 


The list (n-tuple) of entries a;, ,@;2,...,2,, iS Called the ith row (or row i) of 
A, denoted by A,, and exhibited as (a;; aj. *** @j,). The list (m-tuple) 
Aj, 52; 5A my 1S Called the jth column (column /) of A, and denoted by A/. 
In keeping with the topography of the rectangular array, A/ is shown as 


ay, 
ar 


am 


Thus, A has mrows A,,...,A,,and 2 columns /',...,A”. The (i,j) entry in A 
iscommon to the /th row and the /th column. Anm X | matnxiscalleda 
column matrix (or a column vector). Similarly, a 1 X m matrix 1s called a 
row matrix (or a row vector). 

An n Xn matnix A is called a square matrix of order n, and the list of 
entrieS A,,,222,;..-,2,, 1S Called the diagonal of A. (A matnx that 1s not 
necessarily square is said to be rectangular.) If every entry in A that 1s not 
in its diagonal is zero, then A is called a diagonal matrix and denoted 
by diag(d,,...d,), where d;=a,, (= 1,..,.m. A diagonal matnx 
diag(d,,....d,,) is called a scalar matrix if d,; = d, = +++ =d,,. 

A square matrix A = (a,,) is called an upper triangular matrix if a,, = 0 
for all i > j, and strictly upper triangular if a,; = 0 for all 1 = j. A (strictly) 
lower triangular matrix is defined similarly. 

The set of all 7» X nm matrices over a field F is denoted by F”*". The set 
F"**" of all square matrices of order n over F is, for the sake of conve- 
nience, written F,,. Again, for the sake of convenience, the set F'™" of all 
1 X n matrices (row vectors) and the set F”*' of all n X< 1 matrices (col- 
umn vectors) over F are both written F” when the context makes the 
meaning unambiguous. 
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2 Operations on matrices 


In order to set up an algebraic system of matrices, we are going to intro- 
duce two binary operations, addition and multiplication, and a third 
operation called scalar multiplication. These three operations are in- 
duced, in a manner to be described hereafter, by the binary operations of 
addition and multiplication in the field F. 


Definition. Let A = (a,;) and B = (b,;) be two m X n matrices over F. The 
sum of A and B, written A + B, is them X n matrix C = (c,,) such that 


Ci = ay + by, (i,j)Em Xn. 


When the matrices are exhibited as rectangular arrays, the definition 
implies that 


Qi, *** Ayy, by, tt Btn Ay+by, +s Ay tb, 


Am ai amn Bini ce Bin Amy + On ate Amn + Omn 


Note that matrices A and B can be added if and only if they have the same 
size. 

The m X n matrix in which every entry is zero is called the m X n zero 
matrix (or null matrix) and is denoted by 0,,,.,. When the size of the 
m Xn zero matrix is clear from the context, we denote it simply by 0. 


Definition. Let A = (a,,) beanm X n matrix. The negative of the matrix A 
is the m X n matrix B = (b,,) such that 


bi, = —ai;, (i,j)Em Xn. 
The negative of A is written —A. That is, 


Ay, °°° Ary Ay, “*** Tay, 


ami ee amn amy a —~ Amn 


Next we define multiplication of matrices. 
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Definition. Let A = (a;,) and B = (b,;) be, respectively, m Xn andnX p 
matrices over F. The product AB is the m X p matrix C = (c,), where 


n 
Cy = > AyD, (i,j) & m X p. 
k=1 


Note that matrices A and B can be multiplied (in that order) ifand only 
if the number of columns in A is equal to the number of rows in B. 

Let us define the dot product of two n-tuples of elements of Fas follows. 
Given a = (d,,...,a,) and b = (b,,,...,b,), we define 


a: b=a,b,+ -*- +a,6,. 


Then the (i,j) entry in the product AB is the dot product of the 7-tuples 
(44, ,.--,4,) and (b,,,...,5,;)-that is, the dot product of the ith row in A and 
the jth column in B. Thus, we have the rule: Let A = (a,,;), B = (b,,), and 
C = (c,,) be their product AB. Then 


Cy = A; ° B/. 


The reader may also find it helpful to keep in mind the following, which 
displays the fact that the (i,/) entry in AB is the dot product of the ith row 
in A and the jth column in B. 


Let A and B be matrices. The products AB and BA both exist ifand only 
if, for some m,n, A and Bare m X nandn X m matrices, respectively. But 
then ABisanm X mmatrix and BA isann X n matrix; therefore, AB, B.4 
have the same size if and only if m = n. But even when A and B are both 
n X nmatrices, AB and BA are not necessarily equal. This is easily proved 
by the following example. Let 


[oof am a-[t oh 
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_|2 0 _jloi 
AB 3 4 and BA i i! 


Hence, AB # BA. This proves that multiplication of matrices is not com- 
mutative, except in the trivial case of 1 X | matrices. 


Then 


2.1 Theorem. Let 4, B, and C be matrices. 
(i) IfAand Bcan be multiplied, and B and C can be multiplied, then 
A(BC) = (AB)C. 
(ii) If A and Bcan be multiplied and B and C can be added, then 
A(B + C) = AB + AC. 
(iti) IfA and B can be added, and B and C can be multiplied, then 
(A + B)C = AC + BC. 
Proof. (i) Let A = (a;,), B = (6,;), C = (c,;) be m X n,n X p, p X q matri- 


ces, respectively. Then A(BC) and (AB)C are both mm X q matrices. Let 
(1,7) € m X q. Then the (/,/) entry of A(BC) is 


n Pp Pp n 
>» Dik ($ bac) = > ( abi) Cy 
ke! I=} ja \kmt 

= the (1,7) entry of (AB)C. 


This proves A(BC) = (AB)C. 
The proofs for (11) and (111) are similar to the one for (i) and are left as 
exercises. O 


The Kronecker delta function is defined to be 
0: NX N — (0,1) 
such that 
5, = {} WETS for inieN. 
ifix, 


It is worth noting that given an n-tuple (a,,....€,); 


n 
> a,0;j = a;. 
i=} 
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Definition. The n Xn matrix in which, for each (i,j)E a Xa, the (i,j) 
entry is 6,,, is called the identity matrix of order n and denoted by I,,. 


In other words, each entry in the diagonal of /, is 1, and every other 
entry is zero. Thus an identity matrix looks like 


] 0 eee 0 
O 1 = 0 
00 J 


When the order of the identity matrix /, is clear from the context, we drop 
the subscript n and simply write /. The following theorem justifies the 
nomenclature of the identity matrix. 


2.2 Theorem. Let A be anm X n matrix. Then 
[,A=A=Al,. 


Proof. Obvious. O 


Definition. Let A be ann X n matrix. If there is ann Xn matrix B such 
that 


AB = [ = BA, 


then A is said to be invertible and B is called an inverse of A. 


The inverse of a matrix A, if it exists, is denoted by A~'. 


Definition. Let A = (a,;) be an m X n matrix over F, and let a € F. The 
product aA (also written Aa) is the m X n matrix whose (i,j) element is 
aa, for all (i,j) € m X a. 


Remark. aA = diag(a,a,...,%)A, where diag(a,q,...,@) isan m X m scalar 
matnix. 


In the context of matrices over F, the elements of F are called scalars. 
Therefore the multiplication ofa matrix by an element of Fis called scalar 
multiplication. 

Like addition of matrices, scalar multiplication is also a pointwise (or 
component-wise) operation. By that term we mean that the (/,/) entry in 
aA is obtained by multiplying the (i,/) entry in A by a, and likewise, the 
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(i,j) entry in A + Bis obtained by adding the (i,/) entries in A and B. On 
the contrary, multiplication of matrices is not a pointwise operation. 
The following theorem gives properties of scalar multiplication. 


2.3 Theorem. Let A and B be n x n matrices over F, and let «, Be F. 
Then 
(i) a(A+ B)=aAt+aB. 
(ii) (a+ P)A=aA + BA. 
(tii) (a@B)A = a( BA). 
(iv) a(AB) = (aA)B = A(aB). 
(vy) 1A=A. 


Proof. Obvious. O 


Let e,,denote the n X n matrix in which the (i,/) entry is | and all other 
entries are 0. The n? matrices e,,, i,j = 1,...,, are called matrix units of 
order n. (The order n of the matrix e,, is usually known from the context 
and therefore not included in the notation.) It is clear that any n Xn 
matrix A = (a,,) can be written as 


n 
A= D> ayey. 
im] jwl 
The following result about the product of two matrix units will be used 


frequently. 


For any two matrix units e,,, e,, of order n, 
€1 pg = Dipl igs 
where 0, is the Kronecker delta. 


Definition. Let A = (a,;) be an m X n matrix. The transpose of A is the 
n Xm matrix B= (b,;), where 6, = a;,, (J) € a X m. 


The transpose of A is generally denoted by ‘4 and is obtained by 
interchanging rows and columns of A. 


2.4 Theorem. Let A and B be matrices over F and leta € F. Then 


() UA)=A. 
(it) aA) = a(‘A). 
(iii) IfA and B can be added, then 


(A + B)='A + ‘B. 
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(iv) IfA and B can be multiplied, then 


(AB) = ‘B'A. 
(v) IfA is invertible, then'A is invertible and 
(‘A)-' = (A~'). 


Proof. (i), (ii), (111) follow immediately from the definition. To prove (1v), 
let A = (a,,) and B = (b,,) be m X nand n X p matrices, respectively. Let 
‘A = (aj,) and '‘B = (b;,). Then aj, = a,, and bj, = bj. 

Suppose AB = (c,,). Then the (i,j) entry of (AB) is 


-> Gj Dki = > Oy by = >» bin Qy 
= (ii) entry of *BtA, 
The proof of (v) is left as an exercise. O 


Problems 


1. Prove Theorem 2.1. 
2. Prove Theorem 2.4. 


3 Partitions of a matrix 


Let A be an m X n matrix. If we delete any p rows and any qcolumns of A, 
the resulting (7m — p) X (n — q) matrix is called a submatrix of A. 

A matrix is said to be partitioned if it is divided into submatrices (called 
blocks) by horizontal and vertical lines between the rows and the columns. 
Sometimes the multiplication of two matrices A and B becomes simpler 
by partitioning them into suitable blocks. For example, let 


Qi, Q@y2 ayy Qi4 by, dy 
. B b,, by 

b;, by 

Qq, G42 Agy Qua 41 [a2 


be matrices that have been partitioned by the horizontal and vertical lines 
as shown. We can now consider A as the 2 X 2 matrix 


whose elements 4,; are the submatrices of A given by the partitioning. 
Similarly, we can consider B as the 2 X 1 matrix 


A 
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whose elements are the submatrices of B given by the partitioning. Then it 
is easily checked that AB is the same as the product 


4" 4u[[ Bu] = ' By, + Aba | 
Az, A) B,, A,B, + A,B, 

Multiplying two matrices A and B in this manner is called block multi- 
plication of A and B. The method of block multiplication is sometimes 
quite handy when multiplying large matrices. Of course, one must parti- 


tion the two matrices in such a way that the product of the blocks occur- 
ring in the process of multiplication is meaningful. 


4 The determinant function 


Let n be the set (1,2,...,2}. The set of permutations of n is denoted by S,,. 
For convenience a permutation a € S, is usually represented as 


] 2 eee n 
ao(1) o(2) *:: a(n)]' 
In this notation the first column expresses the fact that o maps | to o(1); 
the second column, that o maps 2 to o(2), and so on. 


Note that the order of the columns in this representation of o is imma- 
terial. For example, 


( 1 2 3 ‘) P (; 43 1 
2413) 3 43 12 
represent the same permutation. Furthermore, if in such a representation 
of o, acertain number, say /, 1s absent, it is understood that o keeps / fixed; 
that is, o(i) = i. For example, (4 2 }) will be considered as a permutation o 
of the set (1,2,3,4,5} such that o(3) = 3 and o(4) = 4. 

The product of two permutations of n is again a permutation of n. For 


example, if 
ne 1 23 4 — 1 2 3 4 
23 4 If’ 432 1 


are permutations of degree 4, then ot(1) = 1 because t maps | to 4 anda 
maps 4 to |; similarly, ot(2) = 4, and so on; thus 


_({1 234 
aaa eee a ee 


In the same way 


@a-(! 2 3 4 
32 1 4): 
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The inverse of a permutation of n is also a permutation. For example, if 
o= (3321), then 


which can also be written as (} ? 3 3). More generally, if 


(; 2 eee ") 
o= ° ® ° 
ty ly eee l,, 


is a permutation of degree n, then o~', the inverse of a, is the permutation 


F lo eee i, 
] 2 eee n ¥ 


Thus, ifo,t € S,, then ot,o~',t—' E S,. The set S, is called the symmetric 
group of degree n. 

A permutation that interchanges two elements and keeps others fixed 
is called a transposition. For example, 


which maps i — j and j — /and keeps other elements fixed, is a transposi- 
tion. We write it as (4) or, more usually, as (i j). Note that the inverse ofa 
transposition (i /) is (i /) itself. 

A permutation a of n is called a cycle of length k if o maps a set of k 
elements, say (a, ,@3,...,4;,), cyclically, keeping the remaining elements, if 
any, fixed; that is, 0(a,) = a2, o(a,) = @3,...,0(a,) = a,, and o(a) = aifa 
is not one of the a, ,...,a,. We write such a cycle as (a, a ... a,). A cycle of 
length 2 is a transposition. Observe that (a, a2 ... a;), (@2 G3... a; a;), 
(3 Ag... A, Az), ... (A, a; ... Ay_,) are the same cycle. 

It is easily seen that any permutation o of n can be expressed as a 
product of disjoint cycles, that is, cycles for which no two have any 
common element. For example, 


_(1 2345 6\_ 
o=(i oo G 4 (1 4\(2 65 3). 


Further, every cycle can be expressed as a product of transpositions, 
although not necessarily uniquely. For example, (4532 1)= 
(2 1X3 1X5 14 1). Thus, every permutation in S, can be expressed, not 
necessarily uniquely, as a product of transpositions. It can be shown that if 
a permutation is a product of r transpositions, and also of s transpositions, 
then r,s are both even or both odd (Theorem 2.1, Chapter 7). That is, the 
number of factors in any representation of o as a product of transpositions 
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is always even or else always odd. If a permutation is a product of an even 
number of transpositions, then it is called an even permutation, if it is a 
product of an odd number of permutations, it is called an odd permuta- 
tion. The identity permutation / of n is even because / = (i j)(j i), where 
i,j En. Also, ifa permutation a is even (odd), then o~' is also even (odd). 
Let o= mn, ... n,; then o-'=n,n_, ... n,, because the inverse of any 
transposition 7 is 7 itself. 

For any permutation o we define sgn o to be | or —1 according to 
whether o is even or odd. 


Definition. Let A =(a,;) be an n x n matrix over a field F. Then the sum 


> (SEN O) Ay 415°" Faetn) 


aéSn 


is called the determinant of A and denoted by det A or [A]. 


For example, consider a 3 X 3 matrix A = (a,,). S; has 3! = 6 elements 
given by 


sat 22 sesh ato Pe 

1 2 3) r™\2 3 4)? 2 (3 
al 28 ced 2 3 _(i 
ah2 1 3)? 471302 1)? [NY 


Of these, the first three are even, and the others are odd. Hence, 


WwW NO — ho 
NI ww 
Nee” 


det A = €,1€7233 + A,242343, + A1302,32 
~ A242, A353 ~ Ay3Q22A3, — Ay, A 23032. 


In the following sections we prove some results that enable us to evaluate 
the determinant of any square matrix in a less tedious manner. 

For readers familiar with the concept of a ring (for the definition see 
Chapter 9), it is of interest to note that the determinant of a matrix over a 
commutative ring 1s defined exactly in the same manner. Indeed, most of 
the results proved for determinants over fields also hold for determinants 
over a commutative ring. 


5 Properties of the determinant function* 


5.1 Theorem. Let A =(a,,) be an nxn matrix and let R;= 
(2; Qj2 +++ Ain) denote its ith row. Let R, =(a;, a. °°: a,,) denote a fixed 


* The authors would like to thank Professor R. N. Gupta for his suggestions on 
this section. 
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1 x n matrix. Then 


R, R, R, 
R, R, R, 
(i) det R, +R’ = det R + det R’ 
R, R,, R,, 
R, R, 
R; R; 
(ii) det aR, = adet R, 
R, R, 


(iii) If two rows of a matrix A are equal, then det A =0. 
(iv) det'A=detA 
Proof. (i) The left-hand side is 


Y (Sgn o)ay 4¢1)°°* (Aj 4¢ + Figtiy)"** Angin) 


céSn 


on » (SBN OF) Ay (1) °° Fiatiy*** Fnainy 


céESn 
A 
+ » (sgn T)A, 1) °°’ Biatiy’** Fon) 
aeSn 


R, R, 
=det | R; | + det | R; |, proving (i). 
R, R,, 
(11) Clear. 
(111) Suppose the ith and jth rows are equal. Suppose i <j. Let 
S={aeS,: a(i)<oa(j)}, T= {oeS,:a(i) > o(j)}. Let t=(i, j). 


The mapping @: S— T such that 0(c) = at 1s easily seen to be 
a 1—1 onto mapping. Thus 


|A|= 2, (sgn T)(G 1) °° Fietiy °F jot jy” Fnac 
CE 


—~ Bygiy’* Fioniy Fen jy* Fnatin)) 
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= >) (sgn O)V(G 1 o¢1)** Fiaciy °F jotjy °°” Inacn) 
oeéS 
come er, a Fig ;) ne F jai) ee Angin) 


a » (sgn a)(a 10(1) °° Pian” “Gia '* Anan) 


o€S 
= By gy" Fig jy °° Uiociy* Fnocny) (since ith row = jth row) 
= (). 
(iv) nl d, (sgn O)Qg¢1)1 4 5(2)2 °° Toinyn 
o€Sy 
= 3 (SQNG)Ay, 104)" Ang - (nm) 
oeS, 
e » (sgn a ")ayg- 11)" Bao - 4m 
ceSn 
= det A. 


5.2 Corollary. If a matrix B is obtained from a matrix A by inter- 
changing the ith and jth rows, then det B = — det A. 


Proof. 
R, 
R, +R; 
0 = det , by Theorem 5.1 (iii), 
R;+ R; 
R, 
R l R 1 R I R I 
R, R; R; R; 
= det ° + det . + det 3 + det ’ 
R; R; R; R; 
R, R, R, R, 
by Theorem 5.1! (i), 
= det B+ 0+0+det A. 


Thus det B= —detA 
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5.3 Corollary. [fo is a permutation of {1,...,n} and the rows of A 
are R,,...,R,, respectively, then 


Ry i) 
det | : = (sgn a) det A. 


Rain) 
R, 
Proof. Suppose a is a product of m transpositions. Then A = R can 
R, 


o(1) 
be transformed to : | by m interchanges of rows. Hence 


o{n) 
R, 
Rasy R 
det} : |=(—1)"det |? | =(sgno)det A, by Corollary 5.2 
Rainy R, 
5.4 Theorem. Let X, Y be n x n matrices. Then 


det(X Y) = (det X)(det Y). 
Proof. det (X Y) 


» (sgn o)4( y ri tuan)( » Sadao )“( » ‘ainYaew ) 
g€ESn iy = i2=1 inal 


(X15, X21, °° Xnin \( > (SQN O)3, 01) °° Sat) 


o€Sn 


1S iy,f2 erece ingn 


where i,,i,,...,i, may be assumed to be distinct because if i,,...,i, are 
not all distinct, then 
det(Y) = a (sgn O)Y;,6(1)'"' Vino = 9, by Theorem 5.1 (iii). 


oé€Sa 


Thus det (X Y) = 2, (X 5 x 1) °° 7 2, (sgn O)V ate)’ x Yamew 


= (Sgn T)X i e1)°°'Xneeny (det Y), by Corollary 5.3. 
1)" t(n) 


teSn 


Hence det (X Y) = (det X)(det Y). 
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Problems 


1. Show that if A =(a,,) is an n x n upper (or lower) triangular 
matrix, then det A =a, ,@,.°":,,. 
2. Evaluate the following determinants: 


1 2 3 3 | 4 
(a) 23 4}. (b) 0 1 0}. 
3 4 § 12 —-1l 
1 a a 
3. Prove that j1 5b b?| = (a—b)(b—c)(c —a). 
i c¢ c’? 


4. If A is an invertible matrix, prove det A7’=— — . 

det A 

5. If Aisasquare matrix such that A = A?, provedet A =Oor I. 

6. If A is a square matrix such that A*=0 for some positive 
integer k, then det A = 0. 


6 Expansion of det 4 
Let A be an 1 X nm matrix. Consider all the terms in the sum 


det A = > (= 1)*a 10(1)*-*2 nen) 
oE5S,, 
that contain a given entry a,,asa factor. They are given by those permuta- 
tions o that satisfy the condition o(/) = j. Therefore, the sum of all such 
terms is 


> (= 1) Ay o1y---Biatiy-+-Anotny = ayjAj;, 
o€ S, 
oti)=s 
where 
Ay= > (= 1K 10¢1)-+-@j—1 01 191+ 100+ 1) natn) (1) 
GES, 
ai)=j 
is called the cofactor of a,,in det A. Now 


det A = > (— 1) 191) notn) 


oES, 


=> DY (— 1k ay 041)---nainy: 


jml o€S, 
ai)=j 
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That is, 
det A=) a,Aj = ay Aq +°°1 + G),Ain- (2) 
j=l 
Similarly, 
det A = > > (— 1) ory Fact 
i=! o&€S, 
oti) 
= ¥) a,,Aj;- 
i=] 
That 1s, 
det A = a,,A,,+°°++ a,A,y. (3) 


(2) is called the expansion of det A according to the ith row. Similarly, (3) 
is the expansion of det A according to the jth column. 
Now consider the sum 27_,a,,A,,, p i. Substituting (1) for A,,, we get 


n n 

—1)k 
> ayA,; = » ap; >» (= 1) o¢1)---i— 1014-1) i+ 1.0141) Anata) 
jm 


j=! o€S, 

oti)=/j 

= — 1) 
> (—1) F 4.04 1)++ Fj — 1 ofi— 1) 9 port) Fit 101+ 1):+*F nota) 


o€S, 


=(0, by Corollary 5.6. 


Similarly, 27_,a,,A,; = 0 if p # j. 
Combining the last two results and (2) and (3), we get 


6.1 Theorem. Let A be ann X n matrix. Then 
(i) » apjAiy = Opi det A. 
j=1 
(ii) »y aipAij = Op; det A. 
i=1 


In the following we develop the method to evaluate the cofactors A ,, for 
the matrix A = (a,,). First we find A,,, the cofactor of a,,. By definition, 


A\,= > (— 1)§125(2)++Anotn)> (1) 
where a is a permutation of (2,3,...,2) and kK = 0 or | according to whether 


a is even or odd. 
The expression on the right side is the determinant of the (7 — 1) X 
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(n — 1)submatnix of A, which ts obtained by omitting the first row and the 
first column of A. To find the value of a general cofactor A,, of a;,, we first 
perform some row and column interchanges on A to bring a,, to the (1,1) 
position, so that we can apply (1). Thus, to bring a,, to the (1,1) position, 
we move the ith row up past row /— l,..., row 1, so that after ;— 1 
interchanges it becomes the first row. We next move the jth column to the 
left past the first j — | columns in turn, so that it becomes the first column. 
We now have a matrix that is obtained from 4 by / — | interchanges of 
rows and j — | interchanges of columns. So by Theorem 5.1! the determi- 
nant of the new matrix is (— 1)'*/ det A. The cofactor of a,, in this new 
determinant is the (7 — 1)th-order determinant obtained by omitting the 
new first row and first column, that is, the ‘th row and jth column of A. 
Thus, 


Aj; = (-— 1)f*s . det B, 


where B is the submatrix of A obtained by omitting the /th row and /th 
column of A. 
The foregoing discussion gives the following 


6.2 Theorem. Let A be ann X n matrix. The cofactor of a,, in det A is 
equal to (— 1)'*4 times the determinant of the submatrix of A obtained by 
deleting the ith row and the jth column. 


6.3 Example 
Let A = (a;;)bea3 X 3 matrix. If we expand det A by the first row, we get 
det A =a,,(—1)!*! Q22_ 23 + a. (—])it2 [421 423 
432 33 r— 1) Ay, 33 
+ a.(—1)'+3 | 421 422 
ii(— 1) ay, Ay 


= €14(422433 — 432423) — A,2(A2,433 — 43,423) 
+ €43(421432 — 23,22). 


If we expand det A by the third row, we get 


Qi2 Ay, Qy, a, 
a + a,,(— 1)3*? 3 


det A = a,,(— 1)'*3 
3(— 1) a 


Q22 


a a 
+ a3,(—1)3+3 | 2! 12] 
G42; 422 
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If we expand A by the second column, we get 


det A = a,(—1)'*2 492) 923) 4 gi (— pee [Ou 413 
ie I) Q3, 33 a(— IP A3, 433 
+ q.(— 1)3+2 Qi Gia] 
s(~ 1) Q2, 423 
Problems 
1. Prove the following result: 
1 aa 
1 b Bl =(b—cKc— aXa— bla+b+ Cc). 
1 ce 
2. Prove that 
a, a? eee an! 
1 a, ajo ag! 
= [[ (a,-a,). 
Isg<psn 
lL @y. 2, 8 ay 


This determinant ts known as the Vandermonde determinant. 
3. Prove that 


(b+cP = a? a? 
(a) b? (c + ay b? = 2abc(a+ b+ c)'. 
ce? cc? (a+ bp 
a-b-c 2a 2a 
(b) 2b b-—c-—a 2b =(at+b+c)’. 
2c 2c c-a-—b 


4. Prove that 


3 atb+c at+b?+c? 
(a) atb+c att+bh?t+c? ab+b34+c3 
Crh re Pepe arr 


1 a ari? 
=!1 b b*] =(a— by¥(b—c){c— a). 
1 ¢ c? 
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a bc}? 
(b) Ic a b 
bea 


2bc — a? c? b? 
= c? 2ca — b? a? 
b2 a? 2ab — c? 


a*—be b?—ca c*—ab 
=tc?—ab a*—bc b*—ca 
b?—ca c?—ab at—be 


= (a? + b? + c? — 3abc)’. 
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PART Il 


Groups 


CHAPTER 4 


Groups 


1 Semigroups and groups 


An algebraic structure or algebraic system is a nonempty set together with 
one or more binary operations on that set. Algebraic structures whose 
binary operations satisfy particularly important properties are semi- 
groups, groups, rings, fields, modules, and so on. The simplest algebraic 
structure to recognize is a semigroup, which is defined as a nonempty set S 
with an associative binary operation. Any algebraic structure S with a 
binary operation + or - is normally written (S,+) or (S,:). However, it is 
also customary to use an expression such as “the algebraic structure S 
under addition or multiplication.” Examples of semigroups are 


(a) The systems of integers, reals, or complex numbers under usual 
multiplication (or addition) 

(b) The set of mappings from a nonempty set S into itself under 
composition of mappings 

(c) Theset of nm X n matrices over complex numbers under multipli- 
cation (or addition) of matrices 


Let (S,-) be a semigroup and let a,b € S. We usually write ab instead of 
a: b. Anelement e in S ts called a left identity ifea=aforallaec S.A 
right identity is defined similarly. It is possible to have a semigroup with 
several left identities or several right identities. However, if a semigroup S 
has both a left identity e and a right identity {| then e = ef = £. Therefore, e 
is the unique two-sided identity of the semigroup. 
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Definition. Let (S, +) be a semigroup. If there is an element e in S such that 
ex=x=xe  /forallxES, 


then e is called the identity of the semigroup (S, -). 


Again, let S be a semigroup and let e be a left (right, or two-sided) 
identity in S. Leta € S. Anelement bin Sis called a left inverse of a (with 
respect to e) if ba = e, b is a right inverse if ab = e. An element in S may 
have several left inverses or several right inverses. However, if e is a 
two-sided identity and a has a left inverse b and a right inverse c, then 
b = be = bac = ec=c. 


Definition. Let (S,-) be a semigroup with identity e. Leta € S. If there 
exists an element b in S such that 


ab = e= ba, 


then b is called the inverse of a, and a is said to be invertible. 


Our objective in this chapter and the following four chapters is to give 
basic results and techniques in the theory of a somewhat richer algebraic 
structure called a group. A group may be defined as a semigroup with 
identity in which every element has an inverse (that must be unique). We 
now give an equivalent definition with weaker postulates. 


Definition. A nonempty set G with a binary operation - on G is called a 
group if the following axioms hold: 


(i) a(bc) =(ab)c for all a,b,c E G. 
(ii) There exists e€ G such that ea =a foralla€é G. 
(tii) For everya€ G there exists a’ © G such that a’a= €. 


Suppose (G, -) is a group according to this definition. Let a € G, let a’ 
be a left inverse of a, and let a” bea left inverse of a’. Then a’a = e=a"a’. 
Hence, 


aa’ = ead’ = a” a’‘ad’ = a” ea’ =a" a’ =e. 


Therefore a’ is also a right inverse of a. Now ae = aa’a = ea = a for every 
a€ G. Hence, e is also a right identity. This proves that G is a group 
according to the definition that it is a semigroup with identity in which 
every element has an inverse. 

Henceforth, unless otherwise stated, the binary operation in a group 
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will be written multiplicatively, and we shall denote the group (G,:) 
simply by G. The identity element is denoted by e or 1, and the inverse of 
an element a is written a~'. If the binary operation is written additively, 
the identity element is denoted by 0, and the inverse of an element a is 
written — a, called the negative of a. 

If the binary operation in a group (G,:) is commutative — that 1s, 
ab = ba for all a,b € G - then the group is called commutative or abelian. 

When a group A is abelian, its binary operation is generally wntten 
additively. 

Let (S,:) be a semigroup. Since - is associative, Theorem 4.2 (Chapter 
1) holds here, and therefore there is a unique product of any given ele- 
ments a,,...,a, € S, in that order, that are written a,qa,...a,. For any posi- 
tive integer n we define, as before, a” = II7_, a = aa...a (n-fold product), 
which is called the nth power of a. 

If S has an identity e, we also define - 


a® =e. 


Further, if a is an invertible clement in S, we define, for any positive 
integer 7, 


a"=(a"')", 


where a™! is an inverse of a. 

In the case of an additive semigroup (S,+), we speak of sums and 
multiples instead of products and powers. For any positive integer 1, and 
a € S, we define 


na=S,a=atat-:-+a _ — (n-fold sum). 


1.1 Theorem. A semigroup G is a group ifand only if for all a,b in G, 
each of the equations ax = b and ya = b has a solution. 


Proof. \f G is a group, then clearly the equations ax = b and ya = b have 
solutions x = a~'b and y= ba™', respectively. Conversely, let G be a 
semigroup and suppose ax = band ya = bhave solutions for all a,b € G. 
Let a € G. Then, in particular, the equation ya = a has a solution, say 
y=e, so that ea = a. Now for any 5 in G, the equation ax = b has a 
solution, say X = Xo, so that ax 9 = b. Therefore, eb = eaxy = ax, = b. 
Hence, ¢ is a left identity. 

Further, for every ain G, the equation ya = e has a solution. Hence, a 
has a left inverse. Therefore, G isa group. O 
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The following theorem gives a simpler characterization for a finite 
group. A group (or a semigroup) (G,°) is said to be finite or infinite 
according to whether the set G is finite or infinite. The cardinality of the 
set G is called the order of the group G and denoted by |G| or o(G). 


1.2 Theorem. A finite semigroup G is a group if and only ifthe cancel- 
lation laws hold for all elements in G; that is, 


ab=ac>b=c and ba=ca>b=c 
for all a,b,c E G. 


Proof. lf G is a group, the cancellation laws hold trivially. 

Conversely, let G be a finite semigroup and suppose that the cancella- 
tion laws hold. We shall prove that for all a,b in G, the equations ax = b 
and ya = b are solvable, and therefore, by the last theorem, G is a group. 

Let a,b € G. Let G = (a,,@3,...,a€,), where a,,...,a, are distinct. Con- 
sider the set H = (aa, ,aa),...,aa,). Since aa,€ G for each i = 1.,...,n, 
HCG. By the left cancellation law, aa, = aa, => a, = a,. Hence, H has n 
distinct elements, and therefore H = G. Hence, 6 € H and therefore 
aa, = b for some a, € G. This proves that the equation ax = bis solvable. 
Similarly, it follows from the right cancellation law that the equation 
ya = b is solvable. Hence, by Theorem 1.1, Gisa group. O 


1.3 Examples of groups 


(a) Integers modulo n under addition 


Let x,y € Z and let n be a positive integer. Define x = y (mod n) if 
x — y= qnfor some g € Z. Clearly, = is an equivalence relation. Denote 
by x the equivalence class containing x. Let Z/(n) denote the set of equiva- 
lence classes. We define addition in Z/(n) as follows: x + y=x + y. This is 
well defined, for if x = x’ and y = y’, then 2 then nix — x’ and n|y — y’; hence, 

n\(x + y) —(x’ + y’). Thus, x + y=x’ + y’. Clearly, 0 is the identity, and 
(— x ) is the inverse of x. Thus, Z/() is an additive abelian group of order 
n. Indeed, 


Z/(n) = (0,1,...,(n — 1)). 


(b) Integers modulo n under multiplication 


Consider the set Z/(n) as in (a). We define multiplication in Z/(n) as 
xy = xy. As proved for addition in example (a), we can prove that multi- 
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plication is well defined. Then (Z/(7), -) is a semigroup with identity, and 
the set (Z/(n))* consisting of the invertible elements in Z/(n) forms a 
multiplicative group of order @(n), where @ is the Euler function. 


(c) Permutations under usual composition 


Let X be a nonempty set, and let G be the set of bijective mappings on_X to 
X (i.e., permutations of X). Then G is a group under the usual composi- 
tion of mappings. The unit element of G is the identity map of X, and the 
other group postulates are easily verified by direct applications of results 
on mappings (see Chapter 1). 

This group is called the group of permutations of X (or the symmetric 
group on X) and is denoted as S,. If|X| = ,S, is a group of order n!. 


(d) Symmetries of a geometric figure 


Consider permutations of the set_X of all points of some geometric figures. 
Call a permutation o: X — X a “symmetry” of S when it preserves dis- 
tances, that is, when d(a,b) = d(o(a),o(b)), where d(a,b) denotes the dis- 
tance between the points a,b € X. If o,t are two symmetries, then 


A((at)(a),(ot(b)) = d(a(t(a)),0(t(b))) = d(t(a),t(b)) = d(a,b). 
Thus, ot is also a Symmetry. Further, if o is a symmetry then 
d(a~'(a),0~"(b)) = d(a(a~ "(a)),o(a~ '(b))) = d(a,b). 


So a~' is also a symmetry. Clearly. the identity permutation is a sym- 
metry. Hence, the set of symmetries of S forms a group under composi- 
tion of mappings. 

Let us consider a special case when_¥ is the set of points on the pernme- 
ter of an equilateral triangle: 


The counterclockwise rotations through 0, 27/3, and 47/3 are three of the 
symmetries that move the vertices in the following manner: 
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11 1-2 l1—3 
2-2, 2-3, #g.and 2-1, 
3—3 3— 1 3—2 


respectively. These are commonly written as 


{i 2 3 {1 2 3 2_({!1 2 3 

e-(1 3 3), e B20 a Ets 
(Note. Performing a rotation through 47/3 is equivalent to, or 1s a result- 
ant of, performing a rotation through 27/3 and then again through 27/3. 
This explains our symbol a? for the rotation through 47/3.) 


Three other symmetries are the reflections in the altitudes through the 
three vertices, namely, 


1-1 2—2 3—3 
2—> 3, 3— 1], and 1— 2. 
3—2 1—3 2—1 


These may be rewritten as 


1 23 123 123 


respectively, where the “product” is composition of mappings. 

Since any symmetry of the equilateral triangle is determined by its 
effect on three vertices, the set of six symmetries is a complete list of 
symmetries of an equilateral triangle. We denote this group by D,, called 
the dihedral group of degree 3. Since D, is a subset of S, and each has six 
elements, D, = S;. 

Similar considerations apply to any regular polygon of 7 sides. This 1s 
discussed later in Section 5. 


(e) Linear groups 


Let GL(n,F) be the set of m X n invertible matrices over a field F. Then 
GL(n,F) is a group under multiplication, called the general linear group 
(in dimension n). Consider the subset SL(n,F) of GL({n,F) consisting 
of matrices of determinant |. Let A,B € SL(n,F). Then det(AB) = 
(det AXdet B)=1, so ABE SL(n,F). Clearly, J,E€ SL(n,F). Also, 
det(A~'\det A) = det(/,) = 1 implies det(A~') = 1, so A~' € SL(n,F). 
Therefore, SL({n,F) is also a group under multiplication. 
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({) Direct product of groups 


Let G,,...G, be a family of groups. Then the cartesian product 
G, X ... X G, is a group under the “pointwise” binary operation 


(81 5+++sBn MB 2-0sBn) = (8181 0--sBn& nd» 


where g,,g; &G,, | s isn. Clearly, ife, is the identity of G,, then (e,,...,€,) 
is the identity of G, X ... X G, and (g7"',...,97, ') is the inverse of (2, ,...,2,)- 
Associativity of the binary operation is an immediate consequence of the 
associativity of the binary operations in each G,. 

This group is called the direct product of the groups G,, | s isn, and 
is also written as IT7_,G,. 

A finite group (G,*) of order n can be exhibited by its multiplication 
table, which is an n X n matrix whose (i,j) entry is a, * a,, where G = 
(2; ,...:2,)- 

For example, the multiplication table for the group (Z/(4),+) is 


2 
2 
3 
0 
1 


We observe that the elements in each row (column) are distinct. This is 
a consequence of the cancellation law, as we saw in the proof of Theorem 
1.2. 

The multiplication table for the group D, (=S;), the symmetries of an 
equilateral triangle [Example 1.3(d)], is 


a|a art ee ab ar ob 
a7 | a* e a arh b_ ab. 

b | 6b a*b abe a a 
abiab b ath a eee a 
a*b\a*bh ab b a ae e 


In the following examples we provide solutions to a couple of problems 
illustrating computational techniques. 
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1.4 Examples 


(a) If G is a group such that (ab)" = a"b" for three consecutive integers, 
then ab = ba. 


Solution. We have (1) (ab)" = a"b", (2) (aby'*! = a"*'b"*', and (3) 
(aby'*? = q"*26"t2, From (1) and (2) we get (a"b"Xab) = (aby't' = 
a"t'h"+!, This implies, by appropriate cancellations, b"a = ab". Simi- 
larly, from (2) and (3), we get b"*!'a = ab"*'. But then b"*'a = b(b"a) 
b(ab"), so ab"*' = bab". This gives ab = ba. 


(b) Let G be a group such that (ab)? = (ba)? for all a,b € G. Suppose G 
also has the property that x? = e, x € G implies x = e. Then G is abelian. 


Solution. Let a,b € G. Then, by hypothesis, 
(ab-'bP = (bab™'Y; 

that is, 
a*=ba’b-' = or = a*b = ba’; 


so, in other words, the square of every element in G commutes with every 
element in G. Set c = aba~'b—', and consider 


c? = ab(a~'b~'aXba~'b-') = ab(aa~2b- a bab") 
= ab(ab~'a-2a\ba~'b—') = ab(ab-'a-'\ba-'b-') 
= ab(abb-2a—'\(ba~'b—') = (ab\aba-'b-2\(ba~'b-') 
= (abXaba~'\b-'a~'b-') = (aby*(a~'b-' = (baP(a='b7' 
= @, 


Then, by hypothesis, c = e, and hence ab = ba. 


Problems 


1. Let A be the set of mappings from R to R. Show that (4,+) is a 
group under the usual addition of mappings. 

2. Let H be the subset of C consisting of the nth roots of unity. Prove 
that H 1s a group under multiplication. 

3. Let G be a group such that a? = e for all a € G. Show that G is 
abelian. 

4. Let Gbea group such that (ab)* = a2? for all a,b € G. Show that 
G 1s abelian. 
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5. Prove that the matrices 


sh [3 8h [ro oh ame [Fs 9] 


form a multiplicative group. (This group is known as the Klein 
four-group.) 

6. Prove that the following matrices form a nonabelian group of 
order 8 under multiplication of matrices: 


h(a So} £3 Ls 
PbS Sh Le 


Further, show that if 


o=x|! Of Ju}YI 0 p-| O. vO! 
017 0 —Vv-1~P v-1 of 

then a‘ = e, b? = a’, and b~'ab = a3. (This group is known as the 
quaternion group.) 

7. Let G be a finite semigroup. Prove that there exists e € G such 
that e = e?. 

8. Suppose G is a semigroup with the property that for each aeG, 
there is a unique a*eG such that aa*a =a. Prove 


(i) If e is an idempotent in G, then e* = e. 
(11) If a*x =a*, x, aeG, then x = aa’. 
(111) For all aeG, a*aa* = a* and a** =a. 
(iv) For all a,beG, x = (ba*)* is a solution of xb =a. 
(v) G is a group. 
9. Show that a group in which all the mth powers commute with 
each other and all the mth powers commute with each other, m 
and n relatively prime, is abelian. 


2 Homomorphisms 

Definition. Let G,H be groups. A mapping 
o:G—-H 

is called ahomomorphism if for all x.y © G 


P(xy) = HX) A()). 
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Furthermore, if @ is bijective, then @ is called an isomorphism of G onto 
H, and we write G = H. If@ is just injective, that is, 1-1, then we say that 
is an isomorphism (or monomorphism) of G into H. If@ is surjective, that 
is, onto, then is called anepimorphism. A homomorphism ofG into itself 
is called an endomorphism of G. An endomorphism of G that is both 1-1 
and onto is called an automorphism of G. 

Ifo: G — H is an onto homomorphism, then H is called a homomor- 
phic image of G; also, G is said to be homomorphic to H. If@: G > Hisa 
1-1 homomorphism, then G is said to be embeddable in H, and we write 
GGH. 


(Note that in the definition of homomorphism we have wnitten both 
the groups multiplicatively, though the actual binary operations in G and 
H may be different.) 


2.1 Examples 
(a) Let G and H be groups, and let e’ be the identity of H. The mapping 
6:G—H given by Ax)=e’ forallxEG 


is trivially a homomorphism. 

(b) For any group G the identity mapping i: G — G is an automor- 
phism of G. 

(c) Let G be the group (R‘*, -) of positive real numbers under multipli- 
cation, and let H be the additive group R. The mapping @: R* — R given 
by d(x) = log x is an isomorphism. 

(d) Let G be a group. For a given a € G consider the mapping 


1,:.G—7>G  givenby IJ{x)=axa™' forall xEG. 
Since 
I (xy) = axyaq' = (axa™')aya~') = 1{x)I(y), 
I, is a homomorphism. By the cancellation laws, 
axa”! =aya"'=>>x=y. 
Hence, J, is injective. For any x in G, 
x = a(a~'xa)a~'! = 1,(a~ 'xa). 
Hence, /, is surjective. Consequently, /, is an automorphism of G, called 


the inner automorphism of G determined by a. 


2.2 Theorem. Let G and H be groups with identities e and e’, respec- 
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tively, and let'h@: G— H be a homomorphism. Then 


(i) e)—e’. 
(ii) &(x7') = (O00)! for each x E G. 


Proof. (i) d(e)d(e) = d(ee) = d(e) = e’d(e). Hence, by the cancellation 
law, @(e) = e’. 


(11) Ax) h(x!) = H(xx7!) = H(e) = e’. Hence, A(x~') =(@())'. 0 


Thus, a homomorphism @: G — H not only “preserves” the group 
operation but also the group identity and the unary operation x > x7!. 


Definition. Let G and H be groups, andlet@:G — H beahomomorphism. 
The kernel of d is defined to be the set 


Ker @ = (x € Gid(x) = e’}, 


where e’ is the identity in H. 
Because @(e) = e’, it follows that Ker @ is not empty. 


2.3 Theorem. A homomorphism ©: G — H is injective if and only if 
Ker @ = (e}. 


Proof. Suppose @ is injective, and let x € Ker d. Then (x) = e’ = P(e). 
Hence, x = e. Therefore, Ker @ = (e}. 
Conversely, suppose Ker @ = (e}. Then 


G(x) = dy) => @(xy~!) = P(x)h(y)~! =e! 


=xy'e€Kergdex't=e>x=y.. 


Hence, @ is injective. O 


It can be easily proved that if f, G — H and g: H — K are homomor- 
phisms (isomorphisms) of groups, then the composite map gf: G > K is 
also ahomomorphism (isomorphism). Further, iff: G — H is an isomor- 
phism of G onto H, then the inverse mapping f—': H — G is also an 
isomorphism. We noted earlier that every group is isomorphic to itself. 
This proves that isomorphism of groups is an equivalence relation. 


Problems 


1. Let G be the multiplicative group of the mth roots of unity. Prove 
that G = (Z/(n),+). 


72 Groups 


2. Prove that a group G is abelian if and only if the mapping /. 
G — G, given by f(x) = x!" is a homomorphism: 

3. Show that a group G is abelian if and only if the mapping 
f:G—G, given by f(x) = x’, is a homomorphism. 

4. Find the kernel of each of the following homomorphism: 


(a) f:Z-Z,, given by f(x) =x. 
(b) f: G—Z,, where G is the quaternion group (see Problem 6, 
Section 1) and f(a)=0, f(b) = 1. 


5. Show that there does not exist any nonzero homomorphism of 
the group D, (or S;) (see Example 1.3(d)) to the group Z;. 
6. Let F bea field with two elements. Show that GL(2, F) ~ S;. 


3 Subgroups and cosets 


Definition. Let (G,-) be a group, and let H be a subset of G. H ts called a 
subgroup of G, written H <G, if H is a group relative to the binary 
operation in G. 


For any group G the singleton (e) and G itself are subgroups of G, called 
trivial subgroups. A subgroup // of G is said to be a proper subgroup if H/ # 
(e) and H #G. 

It is easily seen that the identity element of a subgroup of a group 
must be same as that of the group. 


3.1 Theorem. Let G be a group. A nonempty subset H of G is a 
subgroup of G if and only if either of the following holds: 


(i) Forallatbbe H, abE H, anda € H. 
(ii) Forallawbe H, ab“ EH. 


Proof. if H is a subgroup, (i) and (ii) are obviously true. Conversely, 
suppose fH satisfies (i). Then for any a€ H, a~'€ H. Hence, e= 
aa—' € H. Therefore, H is a subgroup. Next, suppose that H satisfies (ii). 
Let a,b € H. Then e= bb-' € H. Hence, b=! = eb=' € H. Therefore, 
ab = a(b—')—' € H, which proves that #/ is a subgroup of G. O 


There is a simpler criterion for a finite subgroup. 


3.2 Theorem. Let (G,-) bea group. A nonempty finite subset H ofG is 
a subgroup if and only ifab € H for all.a,b © H. 
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Proof. \f H is closed under -, then (H,:) is a finite semigroup. Since 
cancellation laws hold for all elements in G, they must hold for all ele- 
ments in H. Therefore, by Theorem 1.2, H is a group and, hence, a 
subgroup of G. The converse is obvious. 0 


For an example of a subgroup, consider the set nZ of all multiples of 7, 
where 7 is any given integer; that is, nZ = (nk|k € Z}. Then for all 
kk’ € Z, nk — nk’ = n(k — k’)E nZ. Hence, nZ is a subgroup of the 
additive group Z. 

More generally, let (G, : ) be any group, and let a € G. Let H be the set of 
all powers of a: H = {a*|k EZ). Then aa’)! = a®-* EH for all 
k,k’ € Z. Hence, His a subgroup of G. H is called the cyclic subgroup of G 
generated by a, written H = [a], and a is called a generator of H.1f Gis an 
additive group, then [a] = (ka|k € Z) is the set of all multiples of a. The 
subgroup Z in the previous example is thus the cyclic subgroup of Z 
generated by 7. A group G is said to be cyclic if G = [a] for some a in G. 

The next two theorems give further examples of subgroups. 


3.3 Theorem. Let @: G— H be a homomorphism of groups. Then 
Ker @ is a subgroup of G and Im @¢ ts a subgroup of H. 


Proof. Ker @ and Im @ are both nonempty. Let a,b € Ker d. Then 
d(ab—') = d(a)d(b)—' = e’e’ = e’ (where e’ is the identity of H). Hence, 
ab-' € Ker d. This proves that Ker @ is a subgroup of G. Next, let 
af € Im d. Then a = f(x), B = &(y) for some x,y € G. Hence, af—! = 
d(x)h(y)~' = d(xy—') € Im , which proves that Im @ is a subgroup of 
H. OQ 


Definition. The center of a group G, written Z(G), is the set of those 
elements in G that commute with every element in G; that is, 
Z(G) = (a E Glax = xa for all x € G}. 


3.4 Theorem. Zhe center of a group G is a subgroup of G. 


Proof. Since ex = x = xe for all x E G, e€ Z(G). Let a,b € Z(G). Then 
for all x € G, 


ab-'x = ab~'xe = ab™'xbb—' = ab~'bxb™! 
= aexb™' = axb™' = xab™'. 


Hence, ab~' € Z(G). Therefore, Z(G) is a subgroup of G. OG 
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Let us next see how new subgroups may be obtained from given sub- 
groups. Let H and K be subgroups of a group G. Then e€ H, e€ K. 
Hence, /1 K is not empty. If a,b € HN K, then ab-' & H, ab-'€ K. 
Hence, ab~' € HN K. Therefore, HN K is a subgroup of G. More gener- 
ally, a similar argument shows that the intersection of any number of 
subgroups of G is again a subgroup of G. But the union of subgroups H 
and K is a subgroup of G if and only if HC K or KC XH. For suppose 
HU K is a subgroup, but neither HC K nor KC H. Then there exist 
elements a@€H—K and b€ K—H. Now abe€HUK: hence, 
abe HUK. If abE H, then b= a~'abeE H, a contradiction. On the 
other hand, if ab € K, then a = abb—' € K, again a contradiction. 

A binary operation on a set S induces a binary operation on the power 
set P(S). Accordingly, for any subsets A,B of a group G, we define 


AB = {xy E GIX E A, yE B). 
(For an additive group G, we define 4 + B= {x + y|x E A, yE B).) 


3.5 Theorem. Let Hand K be subgroups ofa group(G,:). Then HK is 
a subgroup of G if and only if HK = KH. 


Proof. Let HK = KH. Since e = ee € HK, HK is not empty. Let a,b € HK. 
Then a = /h,k,, 6 = hk, for some h, ,h, € H and k, ,k, € K. Hence, 


ab“! = hk, ky'"hy' = hykyhz', 


where k,; = k,kz' € K. Nowk,hz'! € KH = HK. Hence, k,hz' = h,k, for 
some fh, € H, k, € K. Therefore, 

ab™' = hy hyk, = Ngky, 
where h, = h,h, © H. Hence, ab~' € HK. This proves that HK is a sub- 
group. 

Conversely, suppose that HK is a subgroup. Let a € KH, so thata = kh 
for some hE H, kE K. Then a~' = hA~'k~'€ HK. Hence, a€ HK. 
Therefore, KH C HK. Next, let b © HK. Then b=! € HK. Hence, 67! = 
h’k’ for some h’ € H, k’ € K. Therefore, b = k’~'h’-! © KH. Hence, 
HK C KH. This proves that HK = KH. 0 


It follows from the theorem that if G is an abelian group, then //K is a 
subgroup for any subgroups HK of G. 

The subgroups of a group G can be partially ordered by set inclusion. 
Let H and K be subgroups of G. Then H £2 K is the largest subgroup of G 
contained in both H and K, in the sense that if L is any subgroup con- 
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tained in both H and K, then LC HO K. If HK = KH, then HK is the 
smallest subgroup containing both H and K. For if M is any subgroup 
containing H and K, then hk € M forall h € H,k € K. Evenif HK # KH, 
we can still find a smallest subgroup containing both H and K. 

Let S be a subset of G. Consider the family ¢ of all subgroups of G 
containing S. That 1s, 


€ = {AJA is a subgroup of G, SC A). 


Since G is in €, € is not empty. 

Let M be the intersection of all subgroups A in €. Then M isa subgroup 
of Gand SC M. If M’ isa subgroup and SC M’, then M’ € @. Hence, 
MC M’. Therefore M is the smallest subgroup containing S. It is called 
the subgroup generated by S and denoted by [S]. If G = [S] for some 
subset S of G, then S is called a set of generators of G. (Trivially, the set G 
itself is a set of generators of the group G.) If Sis empty, then [S} 1s the 
trivial subgroup (e). If S is a finite set and G = [S], then G 1s said to be a 
finitely generated group. 

To go back to the previous case of subgroups H and K, we see that the 
smallest subgroup containing H and K is the subgroup generated by 
H U K, which is denoted by H V K. 


.3.6 Theorem. Let S be a nonempty subset of a group G. Then the 
subgroup generated by S is the set M of all finite products x,Xz...X, Such 
that, for each i, x,E Sorx;'ES. 


Proof. Clearly, SC M. For any two elements a = x,...x,, and b = ))...y, 
in M, 


ab“'= x, eee XV eee ys'EM. 


Hence, M is a subgroup of G. Let M’ be any subgroup of G containing S. 
Then foreach x € S,x € M’. Hence, x~' € M’. Therefore M’ contains all 
finite products x,-:-x, such that x,eS or x; 'eS, i=1,...,m. Hence, 
MC M’. This proves that M is the smallest subgroup containing S and, 
therefore, the subgroup generated by S. O 


If Sisa singleton (a), the subgroup generated by Sis the cyclic subgroup 
[a] = (a'|i € Z), as shown earlier. 


Definition. Let G be a group, and a€ G. If there exists a least positive 
integer m such that a” = e, then m is called the order of a, written o(a). If 
no such positive integer exists, then a is said to be of infinite order. 
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We now give some examples to illustrate this concept. 


(a) In the additive group Z,, o(2)=2, since 2+2=0. Further, 
o(3) = 4. 
(b) In the multiplicative group G = {xeC|x> =1}, the element 


w= a 2 has order 3. The order of iin G = {xeC|x* = 1} 


is 4. 

(c) Each element different from e in the Klein four-group (see 
Problem 5, Section 1) is of order 2. 

(d) In S3, the order of (} ? 3)is 2 and that of(} 2 +) is 3. 

(e) The order of any element of a finite group G is finite. For, if 
aeG, then the elements a, a’, a°,... of G cannot all be distinct. 
So there exist distinct positive integers i,j such that a‘ = a/, 
which gives a'~/ = e. Hence o(a) is finite. 

(f) The order of any nonzero element in the group (Z, + ) is infinite. 
(There exist infinite groups in which each element is of finite 
order. Find one.) 


3.7 Theorem. Let G be a group anda€ G. 


(i) Ifa" =e for some integer n ¥ 0, then o(a)|n. 
(ii) If o(a)=™m, then for all integers i, a' = a™, where r(i) is the 
remainder of i modulo m. 
(iti) [a] is of order m if and only if o(a) = m. 


Proof. (i) If a" = e, then a~" = e. Hence a! = e for some i > 0. Therefore 
by the well-ordering property of N, there is a least positive integer m = 
o(a) such that a” = e. By the division algorithm, n = mq+r,0sr<m. 
Hence, e = a" = (a™)*a’ = a’. Therefore r= 0 and m|n. 

(ii) Again by the division algorithm, for any iG Z, i=mq+r, 
0=r<_™m. Hence, a! =a’, where r = r(i) is the remainder of i modulo m. 

(ii1) Let o(a) = m. Then e,a,...,a"~' are distinct, for otherwise a! = a/ 
for some i,j, OS i<jsm-— 1. Hence, a/~‘=e, a contradiction. Let 
H = [a] be the cyclic subgroup generated by a. For any i € Z, a! = a"), 
This implies that H has exactly m elements e,a,...,a~ !. 

Conversely, suppose that H is of finite order. Then a’’s are not distinct 
for all i € Z. Hence, a‘ = a/forsome i, j € Z, i < j. Then a/—! = e. Hence, 
a is of finite order, say m. But then H has exactly m elements, as proved 
earlier. O 


38 Corollary. [fG is a finite group, then there exists a positive integer 
k such that x* = e for all x € G. 
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Proof. Since G is finite, the cyclic subgroup [a] is also finite. Hence, 0(a) is 
finite, say n(a). Choose k = I,_gn(a). Then x* =e forallxEG. O 


Definition. Let H be a subgroup of G. Given a € G, the set 
aH = (ah|heE H) 


is called the \eft coset of H determined by a. A subset C of G is called a left 
coset of H in G ifC = aH for someainG. The set of all left cosets of H inG 
is written G/H. 


A right coset Ha is defined similarly. The set of all right cosets of Hin G 
is written H\G. 

Let us confine our attention for the present to left cosets aH of H in G. 
For any a € G the mapping f: H — aH, given by {(h) = ah, is clearly a 
bijection. Hence, every left coset of H has the same cardinality as H. Note 
that H itself is a left coset of H, since eH = H. 

Consider the relation ~ on G, where a ~ b means a~'b € H. For all 
a,b,c E G, 


a'a=eEH, 
abe H=>b'a=(a'by' EH, 


and 
a~'b, b-'cE H=> a7'c = (a~'bY(b7'c) EH. 


Hence, ~ is an equivalence relation on G. It is easily verified that the 
equivalence classes of ~ are precisely the left cosets of H in G. Therefore, 
the set G/H of left cosets of H in Gis a partition of G. That is, the distinct 
left cosets of H are pairwise disjoint and their union is G. 

Consider now the right cosets of H in G. By arguments similar to those 
given above, we can prove that any two night cosets of H have the same 
cardinality, and the set H\G of all right cosets of His a partition of G. Two 
elements a,b in G belong to the same right coset if and only if ab—' € H. 

Consider the mapping yw: G/H — H\G given by aH > Ha™'. The 
mapping is well defined because 


aH = bH=>a'bEe H=>a'(b')'€C H=>Ha'= Hod". 


By a similar argument, Ha! = Hb-' = aH = bH. Hence, w is injective. 
Moreover, yw is obviously surjective; therefore y is bijective. Conse- 
quently, G/H and H\G have the same cardinality. Thus, we have the 
following definition. 


Definition. Let H be a subgroup of G. The cardinal number of the set of 
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left (right) cosets of H in G is called the index of H in G and denoted by 
[G:H]. 


If #7 is the trivial subgroup (e}, each left (right) coset of H in G isa 
singleton. Hence, the index of H is equal to the order of G. Writing | for 
the trivial subgroup {e}, we get |G| = [G: 1]. 

For each nonzero subgroup K of the additive group Z, [Z:K] is 
finite. Indeed it is easy tc check that K = nZ, n £0, and so Z/nZ = Z,, 
(Example 1.3(a)). 

Let G be a finite group, and let #/ be any subgroup of G. Let |G| = n, 
| H| = m. Then every left coset of H has m elements. Since the distinct left 
cosets of #/ are pairwise disjoint and their union is G, we must have 
n= km, where k is the number of left cosets of H in G. In other words, 


(G:1)=[(G:A)(H: 1). 


This proves the following theorem, which is of fundamental importance 
in the theory of finite groups. 


3.9 Theorem (Lagrange). Let G be a finite group. Then the order of 
any subgroup of G divides the order of G. 


We shall now derive some important results from Lagrange’s theorem. 


3.10 Corollary. Let G be a finite group of order n. Then for everya E G, 
o(a)|n, and, hence, a" = e. 

Consequently, every finite group of prime order is cyclic and, hence, 
abelian. 


Proof. Let a © G. By Lagrange’s theorem the order of the cyclic subgroup 
[a] divides n. So o(a)jn. 

If nis prime and a ¥ e, the order of [a] must be 2. Hence, [a] = G, and 
therefore G is cyclic. O 


3.11. Examples 


(a) (Groups of order <6) 


It follows from Corollary 3.10 that groups of orders 2,3, or 5 are cyclic. 
Consider a group G of order 4. If G has an element a of order 4, then 
[a] = G. Hence, G 1s cyclic. Otherwise, every element # e is of order 2; 
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that is, G = (e,a,b,c) and a? = b? = c? = e. Consider the product ab. If 
ab = e, then ab= aa, which implies b= a, a contradiction. Hence, 
ab # e. Similarly, ab # a, ab ¥ b. Hence, ab = c. By the same argument, 
ba = c, bc = a= cb, and ca = b = ac. Hence, Gis abelian. (This group, or 
any group isomorphic to it, is called the Klein four-group.) Thus, all 
groups of order <6 are abelian. 


(b) (Euler - Fermat theorem) 


If a is an integer prime to the positive integer m, then a” = | (mod mm), 
where @ is the Euler function. 


Solution. Note that an element x € Z/(m) is invertible if and only if 
(x,m) = 1, and the order of the multiplicative group (Z/(m))* of the inver- 
tible elements in Z/(m) is ¢(m). Since (a,m)= 1, a€ (Z/(m))*, so 
qh) = 1. This yields a&™ = | (mod m). 


(c) (Poincaré’s theorem) 


The intersection of two subgroups of finite index is of finite index. 


Solution. Let H,K < G, H,K of finite indices. Let a € G. Then it is trivial 
that (HM K)a = HaN Ka. Thus, any right coset of H N K is the intersec- 
tion of a right coset of H and a right coset of K; but the number of such 
intersections is finite. This proves that the number of right cosets of HN K 
is finite. 


(d) 
Let G be a group and a, beG such that ab = ba. If o(a) = m, o(b) = n, and 
(m,n) = 1, then o(ab) = mn. 
Solution. Let o(ab) = k. Then (ab) = e. Consider 
(ab)"™ = a™b™ = @, 
which gives kjmn (Theorem 3.7). Also 
(ab)k=e=> akbt=e>ak=b <. 
Thus, 
o(a*) = 0(b-*) = 0(b*) (Problem 10). 
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But 
a™ = e=>(a*)" = e= 0(a*)|\m (Theorem 3.7). 


Similarly, 0(b*)|n. Therefore, o(a*) divides (m,n) = 1. Hence, o(a*) = 1, 
so a* = e, But then m|k. Similarly, njk. Consequently, mn|k. Since k|mn, 
it follows that k = mn. 

The result contained in the following problem is sometimes quite 
handy when studying structure of finite groups. 


(e) 
Let G be a finite group, and S, 7<G. Then 
_ (SIT! 
IST ISN TI] 


Solution. Consider S X T and definea relation ~ in S X T by (s,t) ~ (s’,t’) 
if and only if s’=sa and t’= a~'t for some aE SN T. Then ~ is 
an equivalence relation. Let (s,t) denote the equivalence class of 
(s,t) & SX T, and let S X T/~ be the set of equivalence classes. It follows 
from the definition of equivalence relation that 


(st) = ((sa,a~"t)]aE SOT). 
Thus, 
(s,¢)| = [SA TI. 


Also, let S X T = U4, (s,,¢,) be the disjoint union of equivalence classes. 
Therefore, 


ISX T)=k|ISN TI. (1) 


We now define a mapping f: S X T/~ — ST by fi (S,,t, ) = 5,t,. fis well 
defined and injective, because 


(S:.t,) = (St) 55a, =a 't, 
forsome aE SN TS 5,t, = 5;t,. 
Also, fis clearly surjective. Hence, 
[SX T/~| =|S7|. 
Therefore, k =|ST]|; so from (1) we get 
ISX Tl|=|ST |S 7, 
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which gives 


SIT 
ISN T| 


[S| 7] =(ST USA TI or |ST| = 


Problems 


1. 


Let H be a subgroup of a group G. Prove that Ha = H if and 
only if ae H. Also show that HH = H. More generally, if A is a 
nonempty subset of G then show that AH = H if and only if ACH. 
Let G be a group and H <G. Suppose |G/H|= 2. Prove that 
aH = Ha for all a€& G. 

Let G be a group and H <G. Show that for all xeG, x7 'Hx is 
a subgroup of G of the same cardinality as that of H. 

Find the subgroups of the groups given in Problems 5 and 6 of 
Section |. 

Show that for any subgroup of a group the inverses of the ele- 
ments of a left coset form a right coset. 

Let V be the group of vectors in the plane, with vector addition as 
the binary operation. Show that the vectors that issue from the 
origin O and have endpoints on a fixed line through O form a 
subgroup. What are the cosets relative to this subgroup? 

(a) If H and K are subgroups whose orders are relatively prime, 
prove that HNK = {e}. 

(b) If H and K are subgroups of order p and n, respectively, 
where p is prime, then either Hn K = {e} or H < K. 

Show that the elements of finite order in any abelian group form a 
subgroup. 

Show that a subset A of a group G cannot be a left coset of two 
distinct subgrov ps of G. If A is a left coset of some subgroup of G, 
then prove that 4 is also a right coset of some subgroup of G. 
Show that o(a) = o(a~') for any element a in a group G. 

If a,b are any two elements in a group G, show that aband ba have 
the same order. 

Let G be a group and a,b € G. Show that o(a) = 0(b7 'ab). 

Ifa group G has only one element a of order n, then a € Z(G) and 
n= 2. 

Let G bea groupanda,b € Gsuch that ab = ba. Let o(a) = mand 
o(b) = n. Show that there exists an element c € Gsuch that o(c) is 
the least common multiple of m and n. 

Show that if G is a group of even order then there are exactly an 
odd number of elements of order 2. 
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16. Prove Wilson’s theorem that, for any prime p, (p— 1)! =—1 
(mod p). 

17. Let abe an element of a group G such that o(a) = r. Let m be a 
positive integer such that (m,r) = 1. Prove that o(a”) = r. 

18. Let abe an element of a group G such that o(a) = r. Let m be a 
positive integer. Prove that o(a™) = r/(m,r). 

19. Let g: G— G’ bea homomorphism of groups. Let a € G. Prove 
that o(g(a))|o(a). Further prove that if g is injective, then 
0(g(a)) = o(a). 

20. Let G bea finite group, and let S, JT be nonempty subsets of G 
such that G # ST. Prove that |G| =|S|+]7]. 

21. Let S be a nonempty subset of a group G, and let 


C(S) = (x € G\|xs = sx, for all s € S). 


Show that C(S) < G. What is C(G)? (C(S) is called the cen- 
tralizer of S in G.) 
22. Find the centralizer of each of the following subsets of S,: 


{(1 2 3)}, {C1 2)}, and {(1 2 3), (1 2)}. 


4 Cyclic groups 

As already defined, a group G is said to be cyclic if G = [a] = (a‘|i € Z) for 
some a € G. The most important examples of cyclic groups are the addi- 
tive group Z of integers and the additive groups Z/(n) of integers modulo 
n. In fact, these are the only cyclic groups up to isomorphism. 


4.1 Theorem. Every cyclic group is isomorphic to Z or to ZKn) for 
some nEN. 


Proof. If G = [a] is an infinite cyclic group, consider the mapping y: 
Z — Ggiven by wi) =a’. Itis clear that y is a surjective homomorphism. 
Moreover, i # j= a' # a/, for otherwise a would be of finite order. 
Hence, y is injective. Therefore, y is an isomorphism. 

Next, suppose G = [a] is a cyclic group of finite order n. Then G = 
(e,a,...,.a"—') and o(a) =n (Theorem 3.7). Consider the mapping y: 
Z/(n) — G given by yi) = a‘. y is well defined and also injective, for let 
i,t € Z/(n). Then 


i=jerni-jeadi=eesa=ai. 
Clearly, y is surjective. Further, 
Witj)=wits)=a't =a‘. 


Hence, y is an isomorphism. O 
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The following is immediate from the above theorem. 


4.2 Theorem. Any two cyclic groups of the same order (finite or infi- 
nite) are isomorphic. 


4.3 Theorem. Every subgroup of a cyclic group is cyclic. 


Proof. Let G = [a] be a cyclic group, and let H be a subgroup of G. If Hisa 
trivial subgroup, the result is obvious. So let H be a proper subgroup of G. 
Ifa! € H, then a~! € H. Hence, there is a least positive integer 7 such that 
b =a™ © H. Weprove that H =[b]. Let a‘ € H. By the division algonthm 
i= mq +r, where 0 s r< m. Then 


a=aqi-™= q'b-'E H. 


Hence, r = 0. Therefore, a! = b*, which proves that H = [5]. O 


The converse of Lagrange’s theorem is not true in general, but the 
following theorem shows that it does hold for cyclic groups. 


4.4 Theorem. Let G be a finite cyclic group of order n, and let d bea 
positive divisor of n. Then G has exactly one subgroup of order d. 


Proof. The result holds trivially if d= 1 or n. So let 1 <d <n and put 
n/d = m. Let G = [a]. Then b = a™ is of order d. Hence, [5] is a cyclic 
subgroup of order d. To prove the uniqueness, let H be any subgroup of G 
of order d. By Theorem 4.3, H is generated by anelementc =a‘. Then 
a*™4 = c4 =e. Hence, n|sd, that is, md|sd and so mls. Let mA = s. This yields 
[a*] <[a”] = [5]. But since each of the subgroups [a*} and [a™] is of 
order d, [a*] = [a]. This proves H = [b). 


4.5 Example 
Let H = [a] and K = [5] be cyclic groups of orders m and n, respectively, 
such that (m,n) = 1. Then H X K is a cyclic group of order mn. 


Solution. Let o(a,b) = d. Now (a,b)™ = (a™,b™) = (e,e) implies djmn. 
Also (e,e) = (a,b)? = (a%,b*) implies a? = e = b*, so mid and nid. There- 
fore, mn|d. Consequently, mn = d. Since |H X K|= mn, it follows that 
(a,b) generates the group H X K. 


Problems 


1. Prove that the multiplicative group of the nth roots of unity 1s a 
cyclic group of order n. 
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2. Find all possible sets of generators of the subgroups of orders 
3,4, and 12 of Z,,. 

3. Prove that there are exactly two elements of an infinite cyclic 
group G that can generate it. 

4. Prove that there are exactly ¢@(n) elements of a cyclic group 
of order n that can generate it. [ Hint: If a is a generator of 
(Z/(n), +), then 1 = md for some me Z, so ma = 1 (mod n), which 
yields (a, n) = 1.] 

5. Show that a group G has no proper subgroups if and only if it 
is a cyclic group of prime order. 

6. Show that every finitely generated subgroup of (Q, + ) is cyclic. 
Show also (Q, +) #(Q”,:), where Q* represents positive 
rational numbers. 

7. Find a homomorphism from S, onto a nontrivial cyclic group. 

8. Show that Z x Z is not cyclic. 

9. Is Z, x Z, cyclic? 

0. Give an example of a nonabelian group such that each of its 

proper subgroups is cyclic. 


5 Permutation groups 


Definition. Let X be a nonempty set. The group of all permutations of X 
under composition of mappings is called the symmetric group on X and is 
denoted by S,. A subgroup of S, is called a permutation group on X. 


It is easily seen that a bijection X = Y induces in a natural way an 
isomorphism S;y= Sy. If |X|= 7, Sy is denoted by S, and called the 
symmetric group of degree n. 

A permutation o € S, can be exhibited in the form 


( I ae ) 

(1) of2) °** on)/’ 

consisting of two rows of integers; the top row has integers I,...,7, usually 
(but not necessarily) in their natural order, and the bottom row has o{/) 
below i for each i = 1,...,7. This is called a two-row notation for a permu- 
tation. There is a simpler, one-row notation for a special kind of permuta- 
tion called a cycle. 


Definition. Let a € S,,. If there exists a list of distinct integers x, ,...,X, © n, 
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such that 


O(X;) = X41, = Laat =I, 
O(X,) =X, 


o(x) = x if x ¢ (x, sug Xe) 


then is called acycle of length r and denoted by (x,...x,). A cycle of length 
2 is called a transposition. 


In other words, a cycle (x,...x,) moves the integers x, ...,x, one step 
around a circle as shown in the diagram (for 7 = 5) and leaves every other 
integer in n unmoved. (If o{x) = x, we say o does not move x.) 


Trivially, any cycle of length | is the identity mapping e. Note that the 
one-row notation for a cycle does not indicate the degree n, which has to 
be understood from the context. 

Two permutations o,t € S, are said to be disjoint if they do not both 
move the same integer; that is, for all x € n, o(x) = x or t(x) = x. 

For the sake of illustration, let us consider the symmetric group S;. 
There are six permutations of the set {1,2,3} given by 


o=(! 4 sh 2:3 =(} 

! 3h lo 3 1p 27 \3 
3 _({i 23 _(i 
2h 2™\3 2 tp Bho 


On examination we find that they are all cycles; that is, 


e=()), o,=(1 2 3) o=(1 3 2), 
tT =(2 3) H=G 1), T=(1 2). 


We can find the product of any two permutations by the usual rule for 


WN N NN 


ww) Wo 
—— 
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composition of mappings. Thus, 


_{1 2 3 
0, = 3 y) | = T>. 


(Recall that in the product to, o acts first, followed by t.) We can, how- 
ever, give a simpler description of the multiplication structure in S;. (This 
is not true for symmetric groups of degree > 3.) 
Let us write o foro, =(1 2 3) and t for t, =(2 3). We easily verify that 

o? = 0,,0° =e, t? =, OT = 13, o*T = T, = to. Hence, 

S; = (€,0,07,1,0T,07T), 
and the product of any two elements in S; can be computed by using the 
relations 

oe=e=t*?, to=oa't (*) 


and the associative property of multiplication. 

Relations of the type (*) that completely determine the multiplication 
table of the group are called its defining relations. We shall later see further 
examples of these relations. 

A subgroup of a symmetric group Sy 1s called a permutation group. In 
other words, a permutation group is a group (G,: ) such that the elements 
of G are permutations of some set X, and : is composition of mappings. 
The following theorem indicates the importance of permutation groups. 


5.1 Theorem (Cayley). Every group is isomorphic to a permutation 
group. 


Proof. Let G be a group. For any given a € G, the mapping 
f,:G—G, givenby (/f(x)=ax forallxEG, 


is a bijection, because ax = ax’ =>x=x’ and y= f,(a~'y) for all 
x,x’,y € G. Consider the mapping 


o:G—S,g  givenby d¢(a)=/f, forallaceG, 
where S, is the symmetric group on the set G. For all a,b,x € G, 
Saslx) = abx = fbx) = SAL) = (SafoX). 
Hence, 
dab) = p(a}p(d). 


Therefore, ¢ is a homomorphism, and Im @ is a subgroup of Sg. More- 
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over, 
d(a) = d(b) => ax = bx forall xe Gma=b. 


Hence, @ is an injective homomorphism. Therefore, G is isomorphic to a 
subgroup of S;. O 


The above isomorphism is called the left regular representation of G. 
Similarly, we have a right regular representation. 


Groups of Symmetries 


We now describe an important class of permutation groups known as 
groups of symmetries. Let X be a set of points in space, so that the distance 
d(x,y) between points x,y is given for all x,y € X. A permutation o of X is 
called a symmetry of X if 


d(o(x),o(y)) = d(x,y) forall x,yE X. 


In other words, a symmetry is a permutation that preserves distance 
between every two points. 

Let 7, denote the set of all symmetries of X. Then for all 0,1 € T, and 
xyEX, 


d(ta~ '(x),ta~ '(y)) = d(a™ '(x),07 "(y)) = d(aa '(x),a07 "(y)) 
= d(x,y). 


Hence, tao—! € T,. This proves that 7, is a subgroup of S, and, therefore, 
is itself a group under composition of mappings. It is called the group of 
symmetries of X. 

Consider, in particular, the case when_X is the set of points constituting 
a polygon of n sides in a plane. It is clear from geometrical considerations 
that any symmetry of X is determined uniquely by its effect on the vertices 
of the polygon. Therefore we need consider only the symmetries of the set 
of vertices, which can be labeled as 1,2,....2. Thus, the group of symme- 
tries of a polygon of n sides is a subgroup of S,,. 


Definition. The group of symmetries of a regular polygon P,, of n sides is 
called the dihedral group of degree n and denoted by D,,. 


The particular case 1 = 3 was considered in Example 1.3(d). 

Let us now consider the general case of a regular polygon P,, (see the 
figure). It is clear that a permutation o € S, isasymmetry of P, ifand only 
if o takes any two adjacent vertices of P,, to adjacent vertices; that is, ifand 
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only if o(1),0(2),...,o(m) are either in the cyclic order 1,2,....7 or in the 
reverse cyclic order n,n — 1,...,2,1. Thus, the symmetries of P, can be 
classified into two kinds: those that preserve the cyclic order 1,2,...,n, and 
those that reverse the order. Let o be a symmetry that preserves the cyclic 
order. Now o(1) can have any one of the values I,...,72, and once o(1) is 
fixed, o(2),...,0(m) are uniquely determined by the cyclic order. Therefore, 
there are exactly 7 symmetries that preserve the cyclic order of the ver- 
tices. Let us call them ¢, ,o2,...,0,, where o; is the symmetry taking | to i. 
By the same argument there are exactly m symmetries 1, ,T2,...,T, that 
reverse the cyclic order, where 1; takes | to 7. This proves that the dihedral 
group D,, has 2n elements g,,t1, (i = 1,...,”). 

We now give a simpler description of the elements of D, that enables us 
to write the product of any two elements easily. Clearly, o, is the identity 
permutation e, and a, isthe cycle(1 2...n). Writing o for a2, we easily 
verify that a‘ takes | toi + 1(i = 1,2,....2 — 1), and o“ takes | to 1. Since a’ 
(i > 1) preserves the cyclic order, it follows that 


a = 0,4), i= 1,..,2— 1, 
o" =e. 


Hence, the m symmetries that preserve the cyclic order of vertices are a! 
(i=0,1,...,2—1), where o is the cycle (1 2...n). 
Writing t for 


fe 12 « 2A” 
i ] n eee 2 ’ 


we clearly see that o't reverses the cyclic order and takes | toi + 1. Hence, 
o't = 1,4, (i= 0,1,...,2 — 1). Moreover, t? takes | to 1 and preserves the 
order. Hence 1? = e. 

Now consider the product to. Since to reverses the cyclic order, and 
ta(1) = t(2) = n, we conclude that to = o”~'t. We sum up the above 
discussion in 


5.2 Theorem. The dihedral group D,, is a group of order 2n generated 
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by two elements a,t satisfying o" = e = t? and ta = o"~'t, where 


1 2 ove n 
o=(1 2... n), r=(I a sake " 


Geometrically, o is a rotation of the regular polygon P,, through an 
angle 27/n in its own plane, and t is a reflection (or a turning over) in the 
diameter through the vertex |. 

Definition. The dihedral group D, is called the octic group. 


As an example of the group of symmetries of a nonregular polygon, we 
shall consider the symmetries of a rectangle in Example 5.3. 


5.3 Example 


The symmetries of a rectangle 
4 3 
Ld. 
are easily seen to be 
(033 a) o-G 57 3) 


_{!| 2 3 4 _{! 3 4 
b=(0 7 a 3) er (3 i? 


Geometrically, a is a rotation through an angle z, and b and c are reflec- 
tions in the axes of symmetry parallel to the sides. It is easily verified that 
a* = b*=c* = e, ab=c, bc = a, ca = b. Thus, the group of symmetries 
of a rectangle is the Klein four-group. 


Wh fh N 


Problems 


1. Find all subgroups of S; and S,. 

2. Ifn= 3, prove that 2(S,) = (e}. [Hint: Lete#oa€ S,. Suppose 
oi)=)j, i# 7. Let k # i,j. Choose rE S, such that t(é) = 1, 
t(j) =k, t(k) =/. Then (otXi) # (tai), so ot # ta.) 

3. Show that|Z(D,)| = 1 or 2 according to whether n is odd or even. 

4. Let a and b be elements of order 2 in a group G. Suppose 
o(ab) = 4. Show that the subgroup generated by a and b is D,. 
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6 Generators and relations 


Sometimes it is convenient to define a group by a subset _Y of the group 
and a set of equations satisfied by elements of X. For example, the group 
(Z/(n),+) is generated by | such that 71 = 0. The group S, is generated by 
{a,b} such that a3 = 1 = 62, b-'ab = a?. The dihedral group D, is gener- 
ated by {a,b} such that a” = | = 62, b='ab = a™'. A finite group, in gen- 
eral, may also be described by listing its elements and by writing down the 
multiplication table; but clearly this isan impossible act for groups of large 
orders (without computers!). Hence, we are naturally interested in a min- 
imal subset X of the group that generates it, and also in the set of equations 
satisfied by elements in the generating set that would suffice to give us all 
products of elements in the group. 


Definition. Let G be a group generated by a subset X of G. A set of 
equations (r, = |)je,q that suffice to construct the multiplication table of G 
is called a set of defining relations for the group (r, are products of elements 
of X). 

The set X is called a set of generators. The system (X;(r, = l)jeq) is 
called a presentation of the group. 
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| Normal subgroups and quotient groups 


We recall that multiplication in a group G induces a product of any two 
subsets A and B of G, given by AB = (xy|x € A, ye B). If A or Bisa 
singleton, we write aB for (a)B and Ab for A{b). Since multiplication in G 
is associative, the induced multiplication of subsets of G is also associa- 
tive. 


Definition. Let G be a group. A subgroup N of G is called a normal 
subgroup of G, written NAG, if xNx~' C N for every x € G. 


Trivially, the subgroups (e) and G itselfare normal subgroups of G. If G 
is abelian, then every subgroup of G is a normal subgroup. But the con- 
verse is not true: A group in which every subgroup is normal is not 
necessarily abelian [Example 1.5(e)]. It is easily proved that the center ofa 
group G is a normal subgroup of G, and if @: G — H isa homomorphism 
of groups, then Ker @ 4G. 

The following theorem gives several characterizations of a normal 
subgroup. 


1.1 Theorem. Let N be a subgroup of a group G. Then the following 
are equivalent. 
(i) NAG. 
(ii) xNx7~'=N for every x€ G. 
(iii) xN= Nx for every x EG. 
(iv) (xN\ YN) = xyN for all x,y € G. 
91 
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Proof. (i) = (ii) Suppose NIG. Let xEG. Then, by definition, 
XNx-'CN, Also, x~'€G. Hence, x~'NxC N. Therefore, N= 
X(x~'"Nx)x-!'! C xNx—!. Hence, xNx7! = N. 

(it) => (111) Nx = (xNx7!)x = xNx7'x = xNe = XN. 

(i11) => (iv) (XN) yN) = x(Ny)N = x(yN)N = xy(NN). Now NNC N 
because N is closed under multiplication. On the other hand, VN = eN C 
NN. Hence, NN = N. Therefore, (xN\ yN) = (xy)N. 

(iv) => (1) xNx7!' = xNx7'eC xNx—'N= xx7'N=eN=N. Hence, 
NAG. O 


Let N be a normal subgroup of G. The above theorem has shown that 
any left coset of N in G is a right coset and vice versa. Hence, we need not 
distinguish between left cosets and right cosets of N. We shall write 
all cosets of N as left cosets and denote the set of all cosets of N in G 
by G/N. 


1.2 Theorem. Let N be a normal subgroup of the group G. Then G/N 
is a group under multiplication. The mapping 6: G — G/N, given by 
x» XN, is a surjective homomorphism, and Ker @ = N. 


Proof. By Theorem 1.1, (xN)\yN) = (xy)N for all x,y € G. Hence, G/N is 
closed under multiplication. Because multiplication is associative in G, 
multiplication is also associative in G/N. The coset eN = Nis the identity 
for multiplication in G/N, and for any x € G, (xN)\(x7'N) = (xx7")N= 
eN. This proves that G/N is a group. 

The mapping @ is obviously surjective. For all x,y € G, 


f(xy) = (xy)N = (XN YN) = G(x)P(y). 
Hence, @ is a homomorphism. Further, xN = eN if and only if x € N. 
Hence, Ker @ = (x € Gid{x) = eN)=N,. O 


Definition. Let N be a normal subgroup of G. The group G/N is called the 
quotient group of G by N. The homomorphism G — G/N, given by x > 
XN, is called the natural (or canonical) homomorphism of G onto G/N. 


Definition. Let G be a group, and let S be a nonempty subset of G. The 
normalizer of S in G is the set 

N(S) = (x € G|xSx7! = S$}. 
The normalizer of a singleton (a) is written N(a). 


13 Theorem. Let G be a group. For any nonempty subset S of G, NS) 
is a subgroup of G. Further, for any subgroup H of G, 


Normal subgroups and quotient groups 93 


(i) N(#) is the largest subgroup of G in which H is normal: 
(ii) if K is a subgroup of NH), then H is anormal subgroup of KH. 


Proof. Clearly, e € MS). lf x, yE NMS), then 
(xm y)S(xm Fyy7 = x Cp Sy7! xe = x FS = x" Sx! x = S. 


Hence x~'y € MS). Therefore, MS) is a subgroup of G. Let H be a 
subgroup of G. Then AHh-' = H for all h € H. Therefore, H is a subset 
and, hence, a subgroup of M(#). Further, by definition, xHx—' = H forall 
x € MA). Hence, Hd M(H), Let K be any subgroup of G such that 
H<VK. Then kHk7'! = H for all k € K. Hence, K C NH). This proves 
that M(/7) is the largest subgroup of G containing H as a normal subgroup. 
Let K be a subgroup of MH). Then for all k € K, kHk~' = H. Hence, 
kH = Hk. Therefore KH = HK. Hence, by Theorem 3.5 of Chapter 4, 
KH is a subgroup of N(#/), and H C KH. Consequently, Hd HK. O 


Definition. Let G be a group. For any a,b€ G, aha™'b™' is called a 
commutator in G. The subgroup of G generated by the set of all commuta- 
tors in G is called the commutator subgroup of G (or the derived group of 
G) and denoted by G’. 


1.4 Theorem. Let G be a group, and let G' be the derived group of G. 
Then 


(i) G’'AG. 
(tt) G/G' ts abelian. 
(iit) If H<AG, then G/H is abelian if and only if G’ C H. 


Proof. (i) Let x = aba~'b~' be any commutator in G. Then x7! = 
bab-'a-' is also a commutator. Moreover, for any g in G, 


exe™' = (gag 'Xgbe~'Kga-'g~'Keb-'g-') 
= (gag™'Kebg~'Kgag™') '(gbg')' € G’. 


Now any element yin G’ is a product ofa finite number of commutators, 
Say 


Y= XXq...Xq5 
where X,,...,X, are commutators. Then for any g€ G, 
eye”! = (9x, 87 'ex.87') ++: (ex,87') EG’. 


Hence, G’ is a normal subgroup of G. 
(it) For all a, b € G, 


(aG’bG’aG’) '(bG’)"' = (aba~'b-')G’ = G’. 
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Hence, (aG’)(bG’) = (bG’\(aG’). Therefore, G/G’ 1s abelian. 
(111) Suppose G/H is abelian. Then for all a,b € G, 


(aba~'b-')H = (aH (bH KaH) (bH)"'! 
= (aH XaH) (bH\XbH)' = H. 


Hence, aba~'b—' € H. This proves that G’ C H. The converse is proved 

similarly. O 

1.5 Examples 

(a) If A< Gand BAG, then AN BAA and AB<G. 

Solution. Obviously AN B < A. To prove its normality, let a€ A, x € 

AN B. Then axa~' € B, because B <I G. Trivially, axa~' € A. Thus, 
VaeEA, WxE ANB, axa *EANBPANBAA. 


To prove AB<G, let a, a,E A, b, bE B. Then ab(a,b,)-'= 
abb;'aj'!€ AB, because (bby')ay'=ajz'b, for some b,€ B. Thus, 
AB <G. 


(b) If H, <4 G, i= 1,2,...,k, then H, H,...H, < G. 
Solution. Follows by induction on k. 


(c) If G is a group and H is a subgroup of index 2 in G, then H isa 
normal subgroup of G. 


Solution. lf a € H, then, by hypothesis, G = H U aH and aHN H= ©. 
Also, G= HU Ha, Han H =©. Thus, aH = Ha, a¢ H. But clearly 
aH = Ha, Wa € H. Hence, Wg € G, gH = Hg, proving that # is normal 
in G. 


(d) If Nand M are normal subgroups of G such that NN M = {e}, then 
nm=mn,WneE N,VWme M. 
Solution. lf n& N, mE M, then 
n-'m—' nm = (n—'m—'n)m E MM = M, 
and also 
n-'m~'nm = n-"(m—'nm) € NN = N. 
Thus, 
n-'m—'nm €E NN M = {e}. 
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Hence, nm = mn. 


(e) Give an example of a nonabelian group each of whose subgroups is 
normal. 


Solution. Let G be the quaternion group (see Problem 6, Section |, Chap- 
ter 4). The only subgroup of order 2, that is, 


(0. %).(-3 -)}. 


is trivially normal. All the subgroups of order 4 have index 2 and, hence, 
are normal by Example (c). Hence, G is a desired group. 


(f) Give an example of a group G having subgroups K and T such that 
K <4 7G, but K is not a normal subgroup of G. 


Solution. Let G be the octic group D, (see 5.2 in Chapter 4). Choose 
T = (e,07,t,0°t) and K = {e,t}. Clearly, from the defining relations of the 
octic group, 7 and K are subgroups. Since the index of T is 2, T <1 G by 
Example (c). Also K as a subgroup of 7 has index 2, and thus K <I T. 
However, K is not normal in G, because if we choose o € G and t € K, 
then o—'to € K. 


(g) Let G be a finite group with a normal subgroup N such that 
(|N|, |G/N]) = 1. Then N 1s the unique subgroup of G of order |}. 
Solution. Let K < G such that |K] =|N]. Then KN/N < G/N. Now 


KN| _|KM__IK1 
IN] {KOM 


N 


So by Lagrange’s theorem |K|/|K N N| divides |G/N|. But since |K| =|N| 
and (|N|, |G/N]) = 1, this implies|K|/|K Q N| = 1. Hence, K = KN N,so 
K=N. 


(h) If G is a group with center Z(G), and if G/Z(G) is cyclic, then G 
must be abelian. 


Solution. Let G/Z(G) be generated by xZ(G), x € G. Let a,b € G. Then 
aZ(G), as an element in G/Z(G), must be of the form x”Z(G) for some 
integer m; that is, 


aZ(G) = x"Z(G). 
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Thus, a= xy for some y € Z(G). Similarly, b = x"z for some z € 
Z(G) and some integer n, so that 


And 


ab = (x™y\x"z) = x™yx"z = x™x"yz 
x= ymtayz since y€ Z(G). 


ba = (x"z\x™y) = x"zx™yp = x"tmzy, since z € Z(G). 


Hence ab = ba, Va,b € G. 


Problems 


I. 
2. 


3. 


13. 


Prove that the center Z(G) of a group G is a normal subgroup. 

If Nis anormal subgroup ofa group G, and H is any subgroup of 
G, prove that NH is a subgroup of G. 

If Nand M are normal subgroups of G, prove that NM is also a 
normal subgroup of G. 

If there exists a unique subgroup H of order 10 (or 20) in a 
group G, show that H (JG. Generalize this result to subgroups 
of other orders. (Hint: See Problem 3, Section 3.) 

Show that the cyclic subgroup of S, generated by(; ? 3)is 
not normal in S;. 

If NV is a normal subgroup of a group G such that NN G’ = {e), 
prove that NC Z(G). 

Let H be a subgroup of a group G such that x? € H for every 
x € G. Prove that H 1G and G/H 1s abelian. 

Find a subgroup N of order |2 in the symmetric group S, on four 
elements. Find the elements of S,/N and show that they can be 
made into a group. 

Prove that in S, the set N= {e, (1 2 3), (1 3 2)} is a normal 
subgroup. Find the elements of S,/N. 

Let H be a subgroup of G. Prove that H <1G if and only if the 
product of two left cosets of H in G is again a left coset. 

Show that the set SL (n,R) of invertible n x n matrices over 
R having determinant | forms a normal subgroup of the linear 
group GL (n,R). 

Let G/N be the quotient group of G. Suppose o(gN) is finite. 
Show that o(gN) divides o(g). Also show that g”eN if and 
only if o(gN) divides m. 

Show that the order of each element of the quotient group 
Q/Z is finite. 


16. 


17. 


18. 


19. 


2 
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Let G be a group such that for some integer m> I, 

(ab)" =a™:b", forall a,beG, 

and let 

G" = {a™\|aeG}. 

Show that 

G" 1G 

and the order of each element of G/G” is finite. 

Show that there does not exist any group G such that |G/Z(G)| 
ae ead compute the first and second derived groups, 


G=G' and G? =(GY. 
If A, B are subgroups of a group G such that 


B<IG, ANB={e}, G=AB, 


then G is called a semidirect product (or split extension) of A and 
B. Show that the dihedral group D, can be expressed as a split 
extension of two suitable proper subgroups. 

Show that the quaternion group (see Problem 4, Section |, 
Chapter 4) cannot be expressed as a split extension of two 
nontrivial subgroups. 

Let L(G) be the set of normal subgroups of a group G. Show 
that L(G) is a modular lattice. Show also that the lattice of all 
subgroups need not be modular. [A lattice Lis called modular 
if for all a,b,ceL, witha<c,av(bac)=(av b)aAc.] 
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In this section we prove some important theorems on group homotnor- 
phisms known as isomorphism theorems. We saw in Theorem 1.2 that 
every quotient group G/N ofa group G isa homomorphic image of G. The 
first isomorphism theorem (which is also known as the fundamental 
theorem of homomorphisms) proves the converse; that is, every homo- 
morphic image of G is isomorphic to a quotient group of G. 


2.1 


Theorem (first isomorphism theorem). Let ¢: G—>G’ be a 


homomorphism of groups. Then 


G/Ker @ = Im @. 
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Hence, in particular, if @ is surjective, then 
G/Ker ¢ = G’. 


Proof. Consider the mapping 

y:G/K—~Imq@__ givenby xK-> d(x), 
where K = Ker @. For any x,y € G, 

xK = yK = y"'x € K = @(y7'x) = e’ & G(x) = Ay). 
Hence, y is well defined and injective. Further, 

W(xK (yK)) = WxyK) = b(xy) = b(x)P(y). 


Hence, y is a homomorphism. Since y is obviously surjective, we con- 
clude that y is an isomorphism of groups. O 


2.2 Corollary. Any homomorphism @: G — G’' of groups can be fac- 
tored as 


h=j yn, 
where n: G— G/Ker @ is the natural homomorphism, y: G/Ker 6 > 


Im @ is the isomorphism obtained in the theorem, and j:1m @ — G’ is the 
inclusion map. 


¢ 
(ie alll ¢, 


i; | 


C/Ke ¢—_—_———_ Im ¢ 
Proof. Clear. O 


2.3 Theorem (second isomorphism theorem). Let H and N be sub- 
groups of G, and N<\G. Then 


H/HO N= HNN. 
The inclusion diagram shown below is helpful in visualizing the 


theorem. Because of this, the theorem is also known as the “diamond 
isomorphism theorem.” 
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Proof. Since N <1G, HN = NH isasubgroup of G and N <4 HN. Consider 
the mapping 


@: H— HN/N — givenby Ah AN forall hE H. 


In fact, @ is the restriction of the natural homomorphism p: G — G/N to 
H. Hence, Ker @ = HNN. Moreover, ¢ is clearly surjective. Hence, by 
the first isomorphism theorem, H//1N N= HN/N. O 


2.4 Theorem (third isomorphism theorem). Let H and K be normal 
subgroups of G and KC H. Then 


(G/K)/(H/K) = G/H. 
This theorem is also known as the “‘double quotient isomorphism 
theorem.” 
Proof. Consider the mapping 
@:G/K—G/H — given by xK —> xH. 
The mapping is well defined, for 
xK = yK>x"yeEK>x 'y6€ H=> xH= yH. 
Further, for all x,y € G, 
G((xK)yK)) = O(xyK) = xyH = (xH)(yH). 
Hence, ¢ is a homomorphism. Now ¢ is obviously surjective, and 
Ker @ = (xK|xH = H) = (xK|x © H} = H/K. 
Hence, by the first isomorphism theorem, (G/K)/(H/K) = G/H. 0 


The next theorem shows that the direct product of two quotient groups 
is isomorphic to a quotient group. 
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2.5 Theorem. Let G, and G, be groups, and N, 1G,, N, 1G,. Then 
(G, X G,)/(N, X N2) = (G,/N,) X (G,/N,). 


Proof. Consider 

p: G, X G, — (G,/N,) X (G,/N2) 
given by 

(XX) > (XN, ,X2N,). 


It is easily verified that @ is a surjective homomorphism and Ker 
od =N,XN,. Hence, the result follows by the first isomorphism 
theorem. O 


We recall that if o: S—» Tis a mapping from a set S into a set 7, and if 
X C T, then o~'(X) denotes the set {s € S|o(s) € X }. 0 '(X’) is known as 
the inverse image of X under o. We may emphasize that a~' is a mapping 
not from T to S but rather from (7) to A(S). However, if a is bijection 
then o~' defined as above coincides with the usual inverse. 


2.6 Theorem (correspondence theorem). Let @: G — G’ be a homo- 
morphism of a group G onto a group G’. Then the following are true: 


i) H<G>qQA)<G’. 

(i)! HW <C> A’) <G. 

(ii) HAG=>qQA) AGC’. 

(ii)! H' 1G’ >¢°- (A) AG. 

(itt) H<GandHDKerd>H=¢ (P(A). 

(iv) The mapping H~> d(H) is a 1-1 correspondence between the 
family of subgroups of G containing Ker @ and the family of 
subgroups of G’; furthermore, normal subgroups of G correspond 
to normal subgroups of G’. 


Proof. (i) Let a,b © H; so d{a),d(b) © O(H). Then 
(aXp(b))-' = d(a)h(b-') = d(ab-') € H(A), 


since ab-' € H. Thus, d(H) < G’. 
(i)’ Let a,b € @-"(A’). Then d(a),d(b) © H’. Now 


(ab!) = d(aXg(b))~! € A’; 
so ab-' € @-"(H’). Thus, 6~'(H’) < G. 
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(ii) Let O(h) € @(H) and g‘ € G’. Then g’ = G(g) forsome g € G. Now 
a’ '(b(h))g’ = (b(g))" * + bth): b(g) = Ole” ‘hee d(H), 


since H <G. Therefore, d(H) 4G’. 
(ii)’ Let hh € -'(H’), g © G. Then @(h) € H’. Now 


b(g” 'hg) =(¢(g))"' + o(h) O(g)EH', 


since H’ 1G’. Therefore, g~'hg € @~ '(H’). Hence, @~'(H’) 4G. 
(iii) Trivially, H C @~'(q(H)). Let x € &- '(h(77)). Then 


d(x) € d(H) => d(x) = d(h) (for some  € H) 
= d(xh—') = d(e) => xh“! € Ker 
=xh"-'E€ H (as HD Ker ¢) 
=> xEH. 


Thus, H = @-'(f(H)). 

(iv) Let H’ < G’. Then by (i)’ 6 '(H7’) is a subgroup of G containing 
Ker @, so by (iii), d(@- '(H’)) = H’. So the mapping H — q{H) is surjec- 
tive. To show that the mapping is injective, let ¢(H,) = ¢(H,), where 
H,,H, are subgroups of G containing Ker @. Then 7 '(d(H,)) = 
d~ '(d(H,)), so, by (iii), H, = H,. The last part of (iv) follows from (ii). © 


Remark. If 6: G — G’ isany homomorphism, then Theorem 2.6 remains 
true if G’ is replaced by Im @. 


2.7 Corollary. Let N be a normal subgroup of G. Given any subgroup 
H’ of G/N, there is a unique subgroup H ofG such that H’ = H/N. Further, 
HG ifand only if H/NAG/N. 
Proof. Consider the natural homomorphism 
@:G—G/N _ givenby x XN. 

By Theorem 2.6 there is a unique subgroup H of G containing N such that 
H’=Q(H)=H/N. O 
Definition. Let G be a group. A normal subgroup N of G is called a 
maximal normal subgroup if 

(i) N#¥G. 

(ii) HAGandHDN=>H=NorH=G. 


Definition. A group G is said to be simple if G has no proper normal 
subgroups, that is, G has no normal subgroups except {e) and G. 
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The following result is an immediate consequence of Corollary 2.7. 


2.8 Corollary. Let N be a proper normal subgroup of G. Then N is a 
maximal normal subgroup of G if and only if G/N is simple. 


2.9 Corollary. Let H and K be distinct maximal normal subgroups of 
G. Then HN K is a maximal normal subgroup of H and also of K. 


Proof. By the diamond isomorphism theorem 
H/HO K = HK/K. 


Now K < HK 1G. Hence, HK = K or HK = G, since K 1s maximal. But 
HK = K => HC K, acontradiction, since H and K are both maximal and 
distinct. Hence, HK = G. Therefore, 


H/H 1 K = G/K. 


Hence, by Corollary 2.8, HM K is a maximal normal subgroup of H and, 
similarly, of K. © 


2.10 Examples 


(a) Let G be a group such that for some fixed integer > 1,(ab)" =a"b" for 
all a,b € G. Let G, = (a € Gla” = e) and G" = (a"|a € G). Then 


G,<G, G"<G, and G/G,=G". 


Solution. Let abE€G, and xEG. Then (ab“'y = a%b")-' =e, so 
ab-'EG,,.. Also, (xax~')" = xa"x~' =e implies xax~' € G,. Hence, 
G, AG. Similarly, G? 4G. Define a mapping f: G — G" by f(a) = a". 
Then, for all a,b € G, f(ab) = (ab)" = a"b". Thus, fis a homomorphism. 
Now Ker f= ({ala"=e)}=G,. Therefore, by the first isomorphism 
theorem G/G, = G". 


(b) Let G be a finite group, and let 7 be an automorphism of G with the 
property 7(x) = xifand only if x = e. Then every g € Gcan be expressed 
asg= x~'T(x) for some x € G. 


Solution. We assert that x~'!7(x) = y~'T(y) if and only if x = y. 


X~'T(x) = y'T(y) & (yx!) = TIY(TO))' 
= yx7! = T( yx!) yx“! = ee y=x, 
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by hypothesis. Therefore 
G = {x7'T(x)|x € G). 


(c) If in Example (b) we also have the condition that 72 = /, then G is 
abelian. 


Solution. Let x € G. Then 


xT (x)) = T(x7 (TX) = T(T(x> T(x) = T(T(x7")x) 
= T((x7'T(x))"'). 


Therefore, for all g € G, T(g~') = g. Nowifa,b € G, then T((ab)') = ab. 
On the other hand, 


T((ab)-') = T(b-'a™') = T(b—')T(a7') = ba. 


Therefore, ab = ba. 
(d) A nonabelian group of order 6 is isomorphic to S;. 


Solution. lf each element is of order 2, then G is abelian. So there must be 
an element a of order 3. Let b € G be such that b ¢ {e,a,a7). Then it is easy 
to check that e,a,a?,b,ab,a2b are all distinct elements and, thus, constitute 
the whole group G. Now b? # a or a’. For let b? = a. Then b° = e. This 
implies that the order of b = 2,3,6. But then the order of b = 2 implies 
a = e, acontradiction. And the order of b = 3, along with b? = a, implies 
ab = e, acontradiction. Also the order of b = 6 implies that G is cyclic, a 
contradiction to the fact that G is not abelian. Hence, b? ¥ a. Similarly, 
b? # a’. Also b? = b, ab, or a?b would imply b = e, a, or a2, which is not 
the case. Therefore the only possibility for b? € G is that b? = e. Further, 
the subgroup [a] = {e,a,a7) generated by a is of index 2 and, thus, normal. 


Therefore, bab! = e, a, or a?. But bab™' = e gives a = e, which is not 

so; and bab~' = a says that G is abelian, which is also not so. Thus, 

bab-' = a’. Hence, G is generated by a,b with the defining relations 
aa=e=b6?, bab™' =a’. 

On the other hand, S;, is also generated by a’ and b’, where 
a’3=e'=b’?, b’a’b’"'! =a’?, 

Then the mapping 


ee’, ara’, a*a’’, bb’, ab a’b’, aztb-a’?b’ 


is an isomorphism of G onto §;. 
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Problems 


1. Ifo isan isomorphism of a group G into a group H, prove that 
(o(a)) = e’ if and only if a” = e. Give an example to show this is 
not true if o is not an isomorphism. 

2. If a group G is generated by a subset S, prove that o(S) also 
generates G, where o is an automorphism of G. 

3. Show that a cyclic group of order 8 is homomorphic to a cyclic 
group of order 4. 

4. Write down all the homomorphic images of 
(i) the Klein four-group, 

(ii) the octic group. 

5. Show that each dihedral group is homomorphic to the group of 
order 2. 

6. Let[a] be acyclic group of order m and [5] acyclic group of order 
n. Show that there is a homomorphism a of [a] into [6] such that 
o(a) = b* if and only if mk is a multiple of 2. Further, if mk = qn, 
show that ¢ is an isomorphism if and only if (7,q) = 1. 


wn 


3 Automorphisms 


Recall that an automorphism ofa group G is an isomorphism of G onto G. 
The set of all automorphisms of G is denoted by Aut(G). We have seen 
(Example 2.1(d), Chapter 4) that every g € G determines an automor- 
phism J, of G (called an inner automorphism) given by x > gxg'. The 
set of all inner automorphisms of G is denoted by In(G). 


3.1 Theorem. The set Aut(G) of all automorphisms of a group G isa 
group under composition of mappings, and In(G) <i Aut(G). Moreover, 


G/Z(G) = In(G). 
Proof. Clearly, Aut(G) is nonempty. Let o,t € Aut(G). Then for all 
xyEG, 

at(xy) = o((txXty)) = (at(x)Kor(y)). 
Hence, ot € Aut(G). Again, 

a(a~"(x)o~'(y)) = aa7"(x)oa7 '(y) = xy. 


Hence o~'(x) o~"(y) = o~'(xy). Therefore, o~' € Aut(G). This proves 
that Aut(G) is a subgroup of the symmetric group S, and, hence, is itself a 
group. 
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Consider the mapping 
@:G—Aut(G) givenby a—>I/,. 
For any a,b € G, 
T(x) = abx(ab)"' = a(bxb™')a~! = 1, 1,{x) 


for all x € G. Hence, ¢ is a homomorphism, and, therefore, In(G) = 
Im @ is a subgroup of Aut(G). Further, J, is the identity automorphism if 
and only if axa~' = x for all x € G. Hence, Ker @ = Z(G), and by the 
fundamental theorem of homomorphisms 


G/Z(G) = In(G). 
Finally, for any o € Aut(G), 
(aI,a7'Xx) = a(ao~'(x)a~') = a(a)xo(a)! = Ta¢a(x); 
hence, ola! = I.) € In(G). Therefore, In(G) d Aut(G). O 


It follows from the theorem that if the center ofa group G is trivial, then 
G = In(G). A group G is said to be complete if Z(G) = (e} and every 
automorphism of G is an inner automorphism; that is, G = In(G) = 
Aut(G). 

When considering the possible automorphisms o of a group G, it is 
useful to remember that, for any x € G, x and o({x) must be of the same 
order (see Problem 19, Section 3, Chapter 4). 


3.2 Examples 


(a) The symmetric group S, has a trivial center (e}. Hence, In(.S,) = S;. 
Now consider all automorphisms of S§,. 
We have seen that 


S, = (e,a,a?,b,ab,a7b) 
with the defining relations 
@=e=b?, ba=a’b. 


The elements a and a? are of order 3, and b,ab, and a’b are all of order 2. 
Hence, for any o € Aut(S;), o(a) = a or a?, o(b) = b, ab, or a2b. More- 
over, when o(a) and o(b) are fixed, a(x) is known for every x € S,. Hence, 
a is completely determined. Thus, there cannot be more than six auto- 
morphisms of S,. Hence, Aut(S;) = In(S,) = S,. Therefore, S; is a com- 
plete group. 

(b) Let G be a finite abelian group of order 1, and let 7m be a positive 
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integer prime to n. Then the mapping o: x x” is an automorphism 
of G. 


Solution. (m,n) =1=>3 integers u and v such that mu+nv= 
11> VxEG, 


x = xmuton = D didi Cady = Daas 


since 0(G) = n. 

Now Vx € G, x = (x“)” implies that o is onto. Further, x" = e=> 
x = e=> x = e, showing that a is 1-1. That 0 is a homomorphism fol- 
lows from the fact that G is abelian. Hence, o is an automorphism of G. 


(c) A finite group G having more than two elements and with the 
condition that x? # e for some x € G must have a nontrivial automor- 
phism. (Also see Problem 1.) 


Solution. When G is abelian, then o: x > x~' is an automorphism, and, 
clearly, o 1s not an identity automorphism. When G is not abelian, there 
exists a nontrivial inner automorphism. 


(d) Let G = [a] bea finite cyclic group of order n. Then the mapping a: 
a*>a™ is an automorphism of G iff (m,n) = 1. 


Solution. lf (m,n) = 1, then it has been shown in Example (b) that o is an 
automorphism. So let us assume now that o is an automorphism. Then 
the order of o(a) = a™ is the same as that of a, which is n. Further, if 
(m,n) = d, then (a”)"/4 = (a")"/4 = e. Thus, the order of a” divides n/d, 
that is, njn/d. Hence, d = 1, and the solution is complete. 


(e) If G isa finite cyclic group of order 7, then the order of Aut(G), the 
group of automorphisms of G, is d(n), where @ is Euler’s function. 


Solution. Let G = [a] and 0 € Aut(G). Since o(a') = (o(a))' for each 
integer i, o is completely known if we know o(a). Now let o(a) = a”, 
msn. Then as in Example (d), we have (m,n) = 1. Thus, each o deter- 
mines a unique integer m less than and prime to ”, and conversely, which 
completes the solution. 


Problems 


1. Find the group of automorphisms of (Z, +). 
2. Find the group of automorphism of (Z,, + ). 
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3. Find Aut(K) where K is the Klein four-group. 
What about Aut(Z, x Z,)? 

4. Show that the group of automorphism of D, is of order 8. 

5. Show that Aut(Z, x Z,) consists of eight elements sending 
(1,0) to (1,0) or (1,2) and (0, 1) to (0, 1), (0, 3), (1, 1), or (1, 3). 

6. Show that Aut(Z, x Z,) = Aut(Z,) x Aut(Z,). 
In the following problems 7 and 8, you may invoke Theorem 
2.1, Chapter 8. 

7. Let A be a noncyclic finite abelian group. Prove that Aut(A) is 
not abelian. 

8. Let G be a finite group such that |Aut(G)| = p. Prove |G| < 3. 
(Hint: Note G is abelian) 


4 Conjugacy and G-sets 


Definition. Let G be a group and X a set. Then G is said to act on X if there 
is a mapping ©: G X X > X, with d{a,x) written a * x, such that for all 
abEG,xEX, 


(i) a*(b* x) =(ab) *x, 
(ii) e*x=x., 


The mapping @ is called the action of G on X, and X is said to be a G-set. 


One of the most important examples of G-sets is the action of the group 
G on itself by conjugation defined by 


a*x=axa™' 


{see Example 4.1(b)]. The reader interested in conjugation and its appli- 
cations only may go directly to Theorem 4.8 without breaking continuity. 

To be more precise, we have just defined an action on the left. Sim1- 
larly, an action of G on X on the right is defined to be a mapping 
XXG—X with (x,a)—> x*a such that x * (ab) = (x * a)* b and 
x*e=x forall a,b€ Gand xe X. 

We shall confine ourselves to groups acting on the left, and for the sake 
of convenience we sometimes write ax instead of a * x. 


4.1 Examples of G-sets 


(a) Let G be the additive group R, and X be the set of complex 
numbers z such that |z]=1. Then X is a G-set under the action 


y*c=e'%c, where yeR and ceX. 


Here the action of y is the rotation through an angle @ = y radians, 
anticlockwise. 
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(b) Let G=S, and X = {x,, x2, X3,X4,X5} be a set of beads forming 
a circular ring. Then X is a G-set under the action 


BX =X BESS. 


(c) Let G=D, and X be the vertices 1,2,3,4 of a square. X is a 
G-set under the action 


gri=e(i), gED,, i€ {1,2,3,4}. 
(d) Let G be a group. Define 
a*e*x=ax, a&G,xEG. 


Then, clearly, the set G is a G-set. This action of the group G on itself is 
called translation. 
(e) Let G be a group. Define 


a*x=axa™', a€G,xeEG. 
We show that G is a G-set. Let a,b € G. Then 


(ab) * x = (ab)x(ab)“' = a(bxb™')a™! 
=a(b* xja~'=a*(b* x). 


Also, e « x = x. This proves G is a G-set. 

This action of the group G on itself is called conjugation. 

(f) Let G be a group acting on a set X. The action of G on X can be 
extended to the power set P(X) as follows. Define 


a*S=(a*x|xE S} 


for all a€@ G and SCX. Then, clearly, (ab)* S= a*(b*S) and 
e*S=Sfora,be€ Gand SC X. Therefore #(X) is a G-set. 

(g) Let G bea group and H <G. Then the set G/H of left cosets can be 
made into a G-set by defining 


a* xH = axH, a€éG, xHE G/H. 


(h) Let G be a group and H <G. Then the set G/H of left cosets is a 
G-set if we define 


a* xH = axa™'H, a&G, xH E G/H. 
To see this, let a,b € G and xH € G/H. Then 

(ab) * xH = abxb“'a“'"H = a* bxb"'H=a*(b* xH). 
Also, e * xH = xH. Hence, G/H is a G-set. 


If X is a G-set, then, as stated earlier, we generally write ax instead of 
a * x for the sake of simplicity. 
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4.2 Theorem. Let G be a group and let X be a set. 
(i) IfX isaG-set, then the action of Gon X induces a homomorphism 
op: G —_ Sy ; 


(ii) Any homomorphism $:G — Sy induces an action of G onto X. 


Proof. (i) We define ¢:G—-S, by (¢(a))(x) = ax, aeG, xeEX. Clearly 
d(ajeS,, aeG. Let a,beG. Then 


(d(ab) x) = (ab)x = a(bx) = a((G(b)Xx)) = (f(a) (((5) KX) 
=(d(a)p(b)Xx) _—for all x © X. 


Hence, d(ab) = d(a)p(b). 
(ii) Define a*x = (¢(a))(x); that is, ax = (¢(a))(x). Then 


(ab)x = (d(ab) x) = (f(a)h(b) (x) = G(aXd(b\(x)) 
= d(aXbx) = a(bx). 


Also, ex = (f(e))(x) = x. Hence, X is a G-set. O 


Regarding G as a G-set as in Example 4.1(d), we obtain Cayley’s 
theorem (see Theorem 5.1, Chapter 4). 


4.3 Cayley’s theorem. Let G be a group. Then G is isomorphic into 
the symmetric group Sg. 


Proof. By Theorem 4.2 there exists a homomorphism @: G — Sg, where 
(d(a)\(x) = ax, a E G, x € G. Suppose G(a) = the identity element in Sg. 
Then for all x € G, (b(a))x) = x. This implies ax = x for all x € X, and 
hence a = e, the identity in G. Therefore, ¢ is injective. O 


Remark. An isomorphism of G into a group of permutations is called a 
Jaithful representation of G by a group of permutations. 


Further, if H < G the set G/H of left cosets of H in G is a G-set [Exam- 
ple 4.1(g)]j. The action of G on G/H gives us another representation of G 
by a group of permutations, which is not necessarily faithful. This is 
contained in 


4.4 Theorem. Let G be a group and H < G of finite index n. Then 
there is a homomorphism @: G — S,, such that Ker @ = NyegxHx~'. 
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Proof. Since |G/H| = n, Sg;4 = S,. By Theorem 4.2 there exists a homo- 
morphism @: G — S, such that d(a)(xH) = axH. Now 


Ker @ = (a € Gid(a) = identity permutation) 
= (a € GlaxH = xH, Vx EG) 
= (a € Gix~'ax € H, Vx EG) 
= (a& Gla € xHx™', Vx E G) 
=()xHx"'. Oo 


xEG 


Remark. We obtain the Cayley representation of G by taking H = {e). 


4.5 Corollary. Let G be a group with a normal subgroup H of index n. 
Then G/H is isomorphic into S,,. 


Proof. Follows from Theorem 4.4, since H 1G implies Ker@=H. O 


4.6 Corollary. Let G be a simple group with a subgroup # G of finite 
index n. Then G is isomorphic into S,,. 


Proof. Follows from Theorem 4.4 by recalling that Ker @<G. O 


Definition. Let G be a group acting ona set X, and let x € X. Then the set 
G, = (g © Glex = x), 
which can be easily shown to be a subgroup, is called the stabilizer (or 
isotropy) group of x in G. 
For example, if G acts on itself by conjugation [Example 4.1(e) ], then, 
for x E G, 
G, = {a € Glaxa' = x} = Mx), 


the normalizer of x in G. Thus, in this case the stabilizer of any element x 
in G is the normalizer of x in G. 

Another example is the G-set G/H, where H < G [Example 4.1(g)]. 
Here the stabilizer of a left coset xH is the subgroup 


{g © GlexH = xH) = (g€ Gix~'ex € H) 
= (gE Gle € xHx~') = xHx™"'. 
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Definition. Let G be a group acting ona set X, and let x © X. Then the set 
Gx = {axla € G) 
is called the orbit of x in G. 


In case G acts on itself as in Example 4.1(d), the orbit of xEG is Gx = 
{ax|aeG} =G. 

Further, considering G as a G-set as in Example 4.1(e), where the 
action of the group G is by conjugation, the orbit of xE G is Gx= 
{axa~'ja € G}, called the conjugate class of x and denoted by C(x). 

We now prove a very important result. 


4.7 Theorem. Let G be a group acting on a set X. Then the set of all 
orbits in X under G is a partition of X. For any x € X there is a bijection 
Gx — G/G,, and, hence, 


\Gx| = [(G:G,]. 
Therefore, if X is a finite set, 

X|= > [G:G,], 

xEC 

where C is a subset of X containing exactly one element from each orbit. 
Proof. For any x,y € X, let x ~ y mean that x = ay for some a € G. Now 
x = ex for all x © X; and if x = ay, then y = a~'x. If x = ay and y = bz, 
then x = (ab)z. Hence, ~ is an equivalence relation on_X, and the equiva- 


lence class of x € X is the orbit Gx. Hence, the set of all orbits is a partition 
of X. Therefore, 


X= LJ Gx (disjoint), 


xEC 


where C is any subset of X containing exactly one element from each 
orbit. 
Given x €_X, consider the mapping 


ob: Gx G/G,  givenby ax-aG, forallaeG. 
For any a,b€ G, 
ax = bx = a'bx=xea'bE G, = aG,, = bG,. 


Hence, ¢ is well defined and injective. Moreover, @ is obviously surjective 
and, therefore, bijective. Hence, |Gx|=|G/G,| = [G:G,]. 
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Since_X is the disjoint union of orbits Gx, it follows that if X is finite, 
then 


IX|= ¥ |Gxl= ¥ (G:G,]. 0 


xEC xEC 


The partition of X given by the orbits in X under G is called the orbit 
decomposition of X under G. 

In Theorem 4.8 we give a version of Theorem 4.7 for the case when G 
acts on itself by conjugation. For the reader interested in conjugation 
only, we give an independent proof of Theorem 4.8. 

Let a € G. Then we define 


C(a) = (xax™'"|x € G), 


which is called the conjugate class of a in G. 
Recall that the normalizer of a in G is the set 


Na) = (x € Glxax—' = a), 
which is a subgroup of G. 


4.8 Theorem. Let G be a group. Then the following are true: 


(i) The set of conjugate classes of G is a partition of G. 
(ii) |C(a)| =[G:N(a)]. 
(iii) If G is finite, |G|=<X[G:N(a)], a running over exactly one 
element from each conjugate class. 
Proof. (i) Define a relation ~ on G as follows: 
a~b if a=xbx™' for some x E G. 


Clearly, ~ is an equivalence relation on G. The equivalence class of ain G 
is then the set (xax~'|x € G}, which is also the conjugate class C(a) of a. 
This proves G = UC(a), a disjoint union of conjugate classes. 

(11) Let a € G. Then the mapping 


a: C(a)-> G/N(a) ~—s givenby = xax~'!+> xNa) 
is trivially onto. Also, it is 1-1. To see this, let x,y € G. Then 
XN(a) = yN(a) = y~'x € Na) = y~'xa = ay~'x = xax™'! = yay, 
Hence, g is a bijection, proving |C(a)| =|G/Na)|. 
(111) Follows from (i) and (ii). O 


The partition of G given by the conjugate classes in Theorem 4.8 (i) is 
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called the class decomposition of G. The equation in part (111) is called the 
class equation (or class equation formula) of G. 
Let G be a group, and let S be a subset of G. If x € G, then the set 


x'Sx = (x7'sx|s € S} 
is called a conjugate of S. 


Definition. Let S and T be two subsets of a group G. Then T is said to be 
conjugate to S if there exists x € G such that T = xSx™', 


Clearly, “being conjugate to” is an equivalence relation in the power 
set ?(G) of the set G. 

Following the method of proof of Theorem 4.8 or by specializing 
Theorem 4.7 to the G-set 9(G), where G acts on (G) by conjugation, we 
can easily prove 


4.9 Theorem. Let G be a group. Then for any subset S of G, 
IC(S)| = [G: N(S)} [MS) = (x € G|x7!Sx = S)}]. 


We now give some important applications of the class equation of a 
finite group G (Theorem 4.8). Let us first observe that x € Z(G) if and 
only if the conjugate class C(x) has just one element, namely x itself. 
Therefore G is the disjoint union of Z(G) and all conjugate classes having 
more than one element. Hence, the class equation can be expressed as 

IG)=|Z(G)i+ S [G:NOd)], 
xEC 
where C contains exactly one element from each conjugate class with 
more than one element. 


4.10 Theorem. Let G bea finite group order of p", where p is prime and 
n> 0. Then 


(i) Ghasa nontrivial center Z. 
(it) ZA Nis nontrivial for any nontrivial normal subgroup N of G. 
(iii) If H is a proper subgroup of G, then H is properly contained in 
NH); hence, if H is a subgroup of order p"~', then H AG. 


Proof. (i) Consider the class equation of G, 
IG|= p"=|Zi+ ¥ [G:NOo), (1) 


xEC 
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where Z = Z(G), and Cis a subset of G having exactly one element from 
each conjugate class not contained in Z. By Lagrange’s theorem |Mx)| 
divides p" for every x € G, and if x € Z, |M(x)|< p". Hence, p divides 
[G: N(x)] for every x € Z. Therefore, p divides 2,--(G: M(x)]. It follows 
now from the class equation (1) that pdivides|Z|. This proves that Z is not 
trivial. 

(ii) Because G = ZU(U__.. C(x) (disjoint), 


N=GNN=(ZNN)U (cw n v) (disjoint). 
xEC 


Hence, 
INJ=|ZNN+ > IC(X) ON NI. (2) 
xEC 


If x € N, then C(x) C N. Hence, for every x € C, C(x) NN is either 
empty or equal to C(x), and therefore |C(.x) N M{is either zero or equal to 
[G:Mx)]. Hence, p divides 2,<-|C(x)N N|. Because p divides |N], it 
follows from the equation (2) that p divides |ZM N|. This proves that 
ZN Mis not trivial. 

(i11) Let K be a maximal normal subgroup of G contained in H. Then 
the quotient group G/K is of order p’ for some r > 0. Hence, by the first 
part of the theorem, G/K has a nontrivial center, say L/K. Since L/K <J 
G/K, it follows by Theorem 2.6 that L <1 G. Now L is not contained in //, 
since K is a maximal normal subgroup of G contained in H, and K ¢ L. 

Let hE H, [E L. Because L/K is the center of G/K, (AK IK) = 
(IK)(hK). Hence, I~ 'hlehK C H. Therefore, L C N(H). This implies that 
H # N(#). Because H Cc N(/1), it follows that H is properly contained in 
NA). 

If H is a subgroup of order p"~', then M(H) must be of order p”. Hence, 
NH) = G and, therefore, HG. O 


4.11 Corollary. Every group of order p? (p prime) is abelian. 


Proof. Suppose G is a nonabelian group of order p?. By Theorem 4.10, G 
has a nontrivial center Z that must be of order p. Leta € G,a € Z. Then Z 
is a proper subset of M(a). Hence, Ma) must be of order p?. This implies 
N(a) = G. Hence, a € Z, acontradiction. G 


We close this section by proving Burnside’s theorem, which has appli- 
cations in combinatorics. For a fixed g € G, let X, = (x € X|gx = x). 


4.12 Theorem (Burnside). Let G be a finite group acting on a finite set 
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X. Then the number k of orbits in X under G is 


} 
k=— X,\. 


Proof. Let S = {((g,x) € G X X|gx =x). For any fixed g € G the number of 
ordered pairs (g,x) in Sis exactly |X,]. Similarly, for any fixed x € X,|G,|is 
the number of ordered pairs (g,x) in S. Hence, 


> IXA=1S1= Dd IG. 
geG xEX 
By Theorem 4.7, |Gx|= [G:G,] =|G{/|G,|. Hence, 


l l 
IGA=IGl > = =I4Gl ce 
2 2 |Gx | >» », |Gx| 
where C is a subset of X containing exactly one element from each orbit. 
Now |Gx| = |Gal| for every x € Ga. Hence, 


I 
»> iGxi 


x€ Ga 


Therefore, 2,<y|G,|=|G| + k, which proves the theorem. O 


4.13 Examples 


(a) Let G be a group containing an element of finite order n > | and 
exactly two conjugacy classes. Prove that |G| = 2. 


Solution. Let e# a€ G such that o(a) = n. Because there are only two 
conjugate classes, these must be {e} and C(a). Therefore, if e# b€ G, 
then b € C(a). Thus 0(b) = o(a) = n. Let mijn. Then there exists an ele- 
ment b such that b € [a], the cyclic group generated by a, and o(b) = m. It 
then follows that m =n. Hence, 7 is prime. We claim that a? = e. If 
a? # e, then a? © C(a). Therefore, a? = xax~' for some x € G. This 
implies a?’ = x‘ax~‘ for all i > 0. Choose j = n. Because x” = e, we obtain 
a?” = a. This yields 2” = | (mod n), a contradiction. Hence, a? = e; so for 
all g € G, g? =e. This implies G is abelian. But, because G has only two 
conjugate classes, |G| = 2. 


(b) Let H be a subgroup of a finite group G. Let A,B € ¥(G), the power 
set of G. Define A to be conjugate to B with respect to H if B = hAh™' for 
some A € H. Then 
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(i) Conjugacy defined in 9(G) is an equivalence relation. 
(ii) If C,,(A) is the equivalence class of A € A(G) (called the conju- 
gate class of A with respect to #7), then 


IC,4(A)| = [Hi 2 N(A)). 


Solution. Follows from Theorem 4.7 by taking X to be the power set P(G) 
of Gand H to be the group that acts on_X by conjugation. Then the orbit of 
A€ X is the conjugate class C,,(A), and the stabilizer is HM N(A). 
The direct argument is as follows. First, (i) is obvious. For the proof of 

(ii), consider the mapping 

a: C;,(A) ~ H/H N NA), 
where 

o(hAh-') = h(HN MA)), he H. 
a is clearly onto. a is also 1-1, for let h,,4, © H. Then 

h (HO NA)) = h(H 9 MA)) = hy'h, © HO MA) 

=> hy'h, A = Ahj'h, => h,Ahj! = h, Ahz'. 


Hence, a is bijective. 


The example which follows gives a beautiful illustration of the Burnside 
Theorem. 


(c) Find the number of different necklaces with p beads, p prime, 
where the beads can have any of n different colors. 


Solution. Let X be the set of all possible necklaces. Clearly |X| =n’. Let 


be a necklace. Let G = [a] be the cyclic group of order p. Then X is 
a G-set under the action 
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a'ex = 


where subscripts are modulo p. 

Clearly the action of any fixed element a'e€G on a given necklace 
yields the same necklace. (The beads are just being permuted cyclically.) 
So the number of orbits is the same as the number of different necklaces. 
We now compute X, for each geG. First 


X= {xeXleex=x}=X. 
So |X,| =n’. Let ge. Since G is cyclic of prime order, g generates 
G. Then 
X, = {xeX|gex =x} 
= {xeEX|g'#x = x, Vi} 
= {xeX|a*ex =x, VaeG}. 
Therefore, X, consists of all those necklaces which are unchanged by 
any permutation and these are precisely those which are of one color. 


Hence |X,| =n, for all geG, g #e. The Burnside Theorem then gives 
the number of different necklaces as 


, (p -- l times) | 
p nv +n+---+n - n? + n(p—1) |. 


Problems 
1. Let G be a group. Show that 
Z(G)= (JC(x), xeG. 


IC(x)i= 1 
Find the number of conjugates of the element (1 3) in Dg. 
Determine the conjugate classes of the symmetric group of degree 
3 and verify that the number of elements in each conjugate class is 
a divisor of the order of the group. 
4. Let H be a proper subgroup of a finite group G. Show that G 
contains at least one element that is not in any conjugate of /7. 
(Hint: |C(H)|=[G:N(H)] and |x" 'Hx|=|A|.J 


w 
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10. 


11, 


12. 


14. 
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Let H <G, and let the index of H in G be finite. Prove that H 
contains a normal subgroup N that is of finite index. 

Let G be a group in which each proper subgroup is contained ina 
maximal subgroup of finite index in G. If every two maximal 
subgroups of G are conjugate in G, prove that G is cyclic. 

Let G be a finite group with a normal subgroup N such that 
(|N|,|G/N]) = 1. Show that every element of order dividing || is 
contained in N. 

Prove that in any group the subset of all elements that have only a 
finite number of conjugates is a subgroup. 

If the commutator group G’ ofa group Gis of order m, prove that 
each element in G has at most m conjugates. 

Let G be the group of symmetries of a cube. Prove that each of 
the following sets is a G-set: 


(i) Set of vertices. 

(ii) Set of long diagonals. 
(iii) Set of faces. 
(iv) Set of edges. 


Prove that if a finite group G has a normal subgroup N with 
|N| = 3 such that N ¢ Z(G), then G has a subgroup K such that 
[G:K] =2. [Hint: G acts by conjugation on N.] 

Let G be a group and X be a G-set. We say G acts transitively 
on X if for all x,, x,eX there exists g¢G such that g*x, = x). 
Show G acts transitively on X if and only if there is only one 
orbit. Also show if H < G and X = G/H, the set of left cosets of 
H, then G acts on X transitively. 

Let G =S, and X be the undirected graph with vertices 1,2,3 
consisting of 


7 a 2 ri 
| 3 1- 3 |——3 i——3 
I. \ PX? 
! 3. -—3. 1—3 FO 3. 


Show G acts on X canonically. Use the Burnside Theorem to 
show that the number of orbits is four. 
Let X = {A,,A,} be the set consisting of diagonals of a square 


15. 
16. 


17. 


18. 
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with vertices 1,2,3, and 4. Regard X as a D,-set under the action 
g*{a,a+2} = {g(a), g(a + 2)}, where {a, a + 2} represents a dia- 
gonal with vertices a,a + 2. Show that there exists a homomor- 
phism ¢: GS, with Kerg = {1,(2 4),(1  3),(1.— 3), (2, 4) }. 
Find the number of necklaces with six beads and two colors. 
Show that there are six different necklaces with four beads and 
two colors and eight different ones with five beads and two 
colors. 

Show that there are 5[k"+ k"'!"21) neckties having n strips 
(of equal width) of k distinct colors. 

Suppose X is a G-set. Show that X x X is also a G-set under the 
action g*(x,,x,)=(g*x,,g*Xx,). Consider the Klein four- 
group K as the group of symmetries of a rectangle with vertices 
1, 2,3, and 4, and let X = {1,2,3,4}. Regarding each of the sets 
X and X x X as a K-set in a canonical way, find the orbits 
of 1e€X and (1,2)eX x X. 

Consider additive and multiplicative groups (Z7, +) and (Z%,:). 
For convenience write Z, = {0,1,2,3,4,5,6}, and let G= 


(0 +) 


and N= 1(, 4 beZy} G. Regard X = 1(%) x, ved} 
y 


and X x X as G-sets in a natural way (as ordinary matrix multi- 


plication). Find the orbit of ((;): (5) Jex x X, 


beZ,,a=1,2, or 4} Show G is a group of order 21 


CHAPTER 6 


Normal series 


| Normal series 
Definition. A sequence (Gp ,G,,....G,) of subgroups ofa group G is called a 
normal series (or subnormal series) of G if 
{e} = Gg IG, IG, <4---dG,_,dG,=G. 
The factors of a normal series are the quotient groups G,/G,_,, 1 sisr. 
Definition. A composition series of a group G is a normal series (Gp,...,G,) 
without repetition whose factors G,/G,_, are all simple groups. The factors 
G,/G,., are called composition factors of G. 
We often refer to a normal series (Go, G,,...,G,) by saying that 
{e} =G,)CG, C --: CG,=G 


is a normal series of G. 
For any group G, {e} = GoC G, = Gis trivially a normal series of G. If 
G is a simple group, then (e} C G is the only composition series of G. 


1.1 Lemma. Every finite group has a composition series. 


Proof. \f |G| = 1 or if G is simple, then the result is trivial (if|G| = 1, then 
(G) is a composition series of G without factors). Suppose G is not simple, 
and the result holds for all groups of order < |Gj. Let H be a maximal 
normal subgroup of G. By the induction hypothesis H has a composition 


t20 
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series {e}C H,C°:-CH. Therefore, G has a composition series 
{fe}CH,C*CHCG. DO 


1.2 Examples 


(a) {e} C {e,e’} C {e,e’,1,1’) C Gis a composition senes for the quaternion 
group G, where e’ = —e and 
[ot ler 
A 1 0 -v=T 
(b) ({e} C (e,(1 2 3), (1 3 2)} C S; is a composition series for S3. 
(c) {e} C (e,07} C (e,0,07,0°} C D, is a composition series for the octic 
group D, (see Section 5, Chapter 4). 
(d) {0} C {0,9} C (0,3,6,9, 12,15) C {0,1,2,...,17}) = Z/(18) is a compo- 
sition for Z/(18). 


We remark that a composition series is not necessarily unique. For 
example, another composition series for the octic group D, is 


(e) C {e,t) C (e,07,1,02t} C Dy. 
However, it is shown in the next section that any two composition 
series of the same group are “equivalent,” in the following sense. 


Definition. Two normal series S = (Go,G,,...,.G,) and S’ = (G6,G},....G7) 
of G are said to be equivalent, written S ~ S’, ifthe factors of one series are 
isomorphic to the factors of the other after some permutation, that is, 


GiGi) ™ GoiylGowy-1, = Mees 


for some oe S,. 


Evidently, ~ is an equivalence relation. 


13 Theorem (Jordan - Hélder). Any two composition series ofa finite 
group are equivalent. 


Proof. Let G be a finite group and suppose the theorem holds for every 
group of order < |G|. Consider any two composition series of G, say 


S;: {e} =G)C G, C°::CG,=G, 

S; {e}=H,CH,C:::CH,=G. 
IfG,_, = H,_,, then S, ~ S, by the induction hypothesis. IfG,_, #H,_,, 
let K = G,_,N H,_,. By Corollary 2.9, Chapter 5, Kis a maximal normal 
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subgroup of G, _, and also of //,_,. Now K has a composition series, say 
{e} = KyC K, C-*:C K. This gives two more composition series of G, 
namely, 


S3: {e) = KyC K,C°::CKCG,,CG, 
Sy {e)}=KyCK,C*:-CKCH,_, CG. 


Now G,_,//,., = G. Hence, by the diamond isomorphism theorem, 
G/G,_, = H,_,/K and G/H,_, = G,_,/K. 


Therefore, S,; ~ S,. Further, by the induction hypothesis, S, ~ S, and 
S, ~S,. This proves that S, and S, are equivalent. O 


Theorem 1!.3 shows that the factors of any composition series of a finite 
group G - that is, the composition factors - are determined uniquely up 
to isomorphism and ordering. 


1.4 Examples 


(a) An abelian group G has a composition series if and only if G is finite. 


Solution. Let G be an abelian group with a composition series 
{e)} = GyC G, C:::C G,=G. Then the quotient groups G,/G,_,, being 
abelian and simple, must be cyclic groups of prime order p, (say), 
i= |,...,r. Hence, 


|G| = []!G./G,-1 =P °° D,- 
i=] 
Therefore, an abelian group G has a composition series if and only if G is 
finite. Moreover the composition factors of G are determined by the 
prime factors of |G]. 


(b) If a cyclic group has exactly one composition series, then it 1s a 
p-group. 


Solution. Suppose G = [a] isacyclic group of order p, «+: p,, where p,...,p, 
are primes (not necessarily distinct). By Theorem 4.4 in Chapter 4, G 
has a unique subgroup G, of order p,-°*:p,, namely, G;= [a ™], 
i= l,....7— 1. Hence, {e) = Go C G, C**-C G,_, C G,=G Is a unique 
composition series of G such that |G;/G,_,|= p,, i = 1,...,7. Thus, every 
permutation of the prime factors of |G| determines a unique composition 
series of G. Hence, G has a unique composition series if and only if 
p, =**: = p,; that is, |G| = p’. 
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(c) The Jordan - Holder theorem implies the fundamental theorem of 
arithmetic. 


Solution. Let G be a cyclic group of order 1. Suppose 7 has two factoriza- 
tions into positive primes, say 


n=p,""'p, and n=4""" Qs: 


Then, as shown in Example (b), G has a composition series whose 
factors are cyclic groups of orders p,,...,p,, respectively, and also a com- 
position series whose factors are cyclic groups of orders g,,....9,, TeSpec- 
tively. By the Jordan — Hdlder theorem the two series must be equivalent. 
Thus, r= s and p, = q, (after reordering if necessary). 


(d) Let G be a group of order p", p prime. Then G has a composition 
series such that all its composition factors are of order p. 


Solution. Clearly, any composition factor is of order p* for some k > 0. 
But since a group of prime power order has a nontrivial center, and each 
composition factor is a simple group, it follows that each composition 
factor is a simple abelian group and, therefore, of order p. 


Problems 


Write down all the composition series for the quaternion group. 

Write down all the composition series for the octic group. 

Write down a composition series for the Klein four-group. 

Find the composition factors of the additive group of integers 

modulo 8. 

5. Find all composition series for Z/(30). Verify that they are equiv- 
alent. 

6. If Gisa cyclic group such that |G|= p, «°- p,, p, distinct primes, 
show that the number of distinct composition series of G is r!. 

7. Let G bea finite group and N <JG. Show that G has a composi- 
tion series in which N appears as a term. 

8. Give an example of two nonisomorphic finite groups G, H 
which have isomorphic composition factors. 

9. Let us define a composition series of any group G to be a 

descending chain (possibly infinite) 


G,=GDG, 2G,2---D {e} 


such that G,;<G,_,, i= 1,2,3,..., and G,_ ,/G; is simple. 
Show that Z has no finite composition series but does possess 


f&wnn 
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infinite composition series. Write two such series and show that 
the Jordan-~Holder Theorem does not generalize to infinite 
groups. 


2 Solvable groups 


Recall that the subgroup G’ generated by the set of all commutators 
aba~'b~' in a group G is called the derived group of G. For any positive 
integer 7, we define the nth derived group of G, written G™, as follows: 


G”) = GC’, G™ = (G@-y (n > 1). 


Definition. A group G is said to be solvable ifG™ = (e) for some positive 
integer k. 


It is obvious that if G is abelian, then G’ = (e). Thus, trivially, every 
abelian group is solvable. 


2.1 Theorem. Lei G bea group. IfG is solvable, then every subgroup of 
G and every homamorphic image of G are solvable. Conversely, if N is a 
normal subgroup of G such that NandG/N are solvable, then G is solvable. 


Proof. Let G be a solvable group, so that G“) = {e). Then for any sub- 
group H of G, H™ c G™ for every positive integer n. Hence, H“ = {e). 
Therefore H is solvable. 

Let @: G — H be a surjective homomorphism. For any a,b € G, 


d(aba~'b-—') = d(a)h(b)d(a)~'d(b)-'. 


Hence, H’ = g(G’), and, by induction, H™ = ¢(G™) for any positive 
integer n. Therefore, H“) = d((e)) = {e}. This proves that every homo- 
morphic image of G 1s solvable. 

Conversely, let NV <) G such that N and G/N are solvable. Then N“™) = 
{e) and (G/N) = (e} for some positive integers k,/. Because G/N 
is a homomorphic image of G (under the natural homomorphism 
G—G/N), (G/N)™ = G”N/N for every positive integer . Hence, 
G CN. Therefore, G’+” Cc N® = (e} and G is solvable. O 


2.2 Theorem. A group G is solvable ifand only ifG has anormal series 
with abelian factors. Further, a finite group is solvable if and only if its 
composition factors are cyclic groups of prime orders. 


Proof. \f G = (e), then 
(e) = G*) CGF-YDC--CEGMC GM=G 
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is a normal series of G in which each factor G“~-/G™ is abelian. Con- 
versely, suppose G has a normal series 


(e}= H,CH,C:::CH,,CH,=G 


such that H,/H,_, is abelian, i = 1,...,r. Then by Theorem 1.4 of Chapter 
5, Hi C H,_,. Hence, G’ = H; C H,_,, and, by induction, G® C H,_,, 
i= |,...,r. Therefore, G® = (e}, and G ts solvable. 

Now suppose G is a finite solvable group. Then G has a normal series 
(e) = H, C:*:C H,=G in which each factor H,/H,_, 1s abelian. Now 
H,/H,-_,, being a finite abelian group, has a composition series whose 
factors are cyclic of prime order. Inserting the corresponding subgroups of 
H, between the terms H,_, and H, in the normal series Hy C --: C H,, we 
get a composition series of G in which every factor is a cyclic group of 
prime order. The converse follows from the first part of the theorem. © 


2.3 Examples 
(a) S; is solvable, for 
(e} CN =(e,(1 2 3),(1 32)} CS, 


is a normal series such that S,/N is abelian. 

We shall see in the next chapter that S, is solvable, but S,, 1 = 5, is not 
solvable. 

(b) The dihedral group D,, is solvable, for 


(e} C (e,9,...,.0"~'} C D,, 


is 2 normal series with abelian factors. 

(c) Let G be a finite solvable group and M be a minimal normal 
subgroup of G. Then M is cyclic of order p, for some prime p. We may 
first show that M is an abelian p-group. 


Problems 


1. Show that the group of all upper triangular matrices of the form 


1 a b 
0 1 cl], a,b,ceR, 


00 1 


is solvable. Generalize this ton x n upper triangular matrices. 
2. Show that a simple group is solvable if and only if it is cyclic. 
3. Show that if A, B are groups, then A x B is solvable if and only 
if both A, B are solvable. 
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4. Let A, B be normal solvable subgroups of a group G. Show that 
AB is also solvable. 


Additional problems on solvable groups are given in the problem set 
at the end of Chapter 8. 


3 Nilpotent groups 

We define inductively the nth center of a group G as follows. For n = 1, 
Z,(G) = Z(G). Consider the center of the quotient group G/Z,(G). Be- 
cause Z(G/Z,(G)) is a normal subgroup of G/Z(G), by Corollary 2.7 of 
Chapter 5 there is a unique normal subgroup Z,(G) of G such that Z,(G)/ 
Z(G) = Z(G/Z,(G)). Thus, inductively we obtain a normal subgroup 
Z,(G) of G such that Z,(G)/Z,_(G) = Z(G/Z,,_ (G)) for every positive 
integer 2 > 1. Z,(G) is called the nth center of G. Setting Z,(G) = (e)}, we 
have Z,(G)/Z,,_ (G) = Z(G/Z,,_ (G)) for all positive integers n. It follows 
immediately from the definition that 


ZAG) = {x € Gixyx7'y7' € Z,_,(G) for all yE G}. 


Hence, (Z,(G))’ C Z,_,(G). 
The ascending series 


(e} = Z(G) C Z(G)C--C Z,(G)C-:: 
of subgroups of a group G is called the upper central series of G. 


Definition. A group G is said to be nilpotent if Z,,(G) = G for some m. The 
smallest m such that Z,,(G) = G is called the class of nilpotency of G. 


If G isan abelian group, then Z,(G) = Z(G) = G. Thus, trivially, every 
abelian group is nilpotent. For a nontrivial example we prove 


3.1 Theorem. A group of order p" (p prime) is nilpotent. 


Proof. By Theorem 4.10 of Chapter 5, G has a nontrivial center Z,(G). 
Now G/Z,(G), being of order p’ < p”, has a nontrivial center. Hence, 
(Z,(G)|<|Z,(G)|. By repeating the argument we get |Z,,(G)| = p” for 
some msn. Hence, Z,(G)=G. O 


3.2 Theorem. A group G is nilpotent if and only if G has a normal 
series 

(e}=G,CG,C°:CG,=G 
such that G,/G,_, C Z(G/G,-_,) for all i = 1,...,m. 
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Proof. Let G be a nilpotent group of class mm, so that Z,,(G) = G. Then 
(e} = Z(G) C Z(G)C°:-C Z,(G)=G 


is a normal series that satisfies the required condition, since Z,(G)/ 
Z,-(G) = Z(G/Z,_ ,(G)). Conversely, suppose that G has a normal series 
(e} = GypC G, C*::C G,, = G such that G;/G,;_, C Z(G/G;_,) for all i. 
Then G, C Z(G). Now G,/G, C Z(G/G,). Hence, for every x € G,, yE 
G, xyx7'y~' EG, C Z(G). Therefore,G, C Z(G). By repeating the ar- 
gument we get G; C Z(G), i = |,...,a1. Hence, G = G,, C Z,,(G), and G is 
therefore nilpotent. O 


3.3 Corollary. Every nilpotent group is solvable. 


Proof. Let G be a nilpotent group. By Theorem 3.2, G has a normal series 
with abelian factors. Hence, by Theorem 2.2, G is solvable. 


The converse of this corollary is not true. For example, the symmetric 
group S§; is not nilpotent because its center is trivial. But it is easily verified 
that S®) is trivial. Hence, S; is solvable. Thus, we see that the class of 
nilpotent groups is contained strictly between the classes of abelian groups 
and solvable groups. 


3.4 Theorem. Let G be a nilpotent group. Then every subgroup of G 
and every homomorphic image of G are nilpotent. 


Proof. Let G bea nilpotent group of class m, so that Z,,(G) = G. Let Hbea 
subgroup of G. Then, clearly, HM Z(G) C Z(H). For all x € Z(G), 
and y€ G, xyx7'y~'! € Z(G). Hence, for all x E HN Z(G), and y € H, 
xyx7'y" E HO Z(G). Therefore. HM Z(G) C Z,(H). By repeating the 
argument we see that HM Z(G) C Z,(/7), i= l,....m. Hence, H = HN 
G=H1Z,(G)C Z,(1). Thus, H is nilpotent. 

Let d: G — H be a surjective homomorphism. Then 


b(xrx7' yo!) = A(x)b(V)O(0)7'HQQ)™! for all x,y € G. 
Hence, @(Z(G)) C Z(H). Let x € Z(G). Then x-x~'y~! € Z(G) for all 
y € G. Hence 

A(x)A(O(x)7'H(¥)7! € A(Z(G)) C Z(H). 


Because @ is surjective, it follows that (x) € Z(H). Therefore, 
d(Z,(G)) C Z(H). By repeated use of the argument we see that 
HZ (G)) C Z(H), i= 1,....m. Hence, H = 6(G) = G(Z,,(G)) C Z,,(H). 
Therefore, H is nilpotent. © 
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3.5 


H, X-- 
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Theorem. Let H,,...,H, be a family of nilpotent groups. Then 
> H,, is also nilpotent. 


Proof. We show that if H and K are nilpotent groups, then H X K is also 
nilpotent. The proof of the theorem then follows by induction. It can be 
easily shown that Z,(H X K) = Z,(H) X Z,(K) for any m. Hence, 
Z(H X K) = H X K for some m. Therefore, H X K is nilpotent. O 


Problems 


1. 
2. 
3 


4. 


Show that S, is not nilpotent for 1 = 3. 

is D, nilpotent? solvable? 

Give an example of a group G such that G has a normal subgroup 
H with both H and G/H nilpotent but G nonnilpotent. 

Let a group G be a product of normal subgroups H and K of 
orders p* and q’, respectively, where p and q are distinct primes. 
Show that G 1s nilpotent. 

Let G bea finite nilpotent group, and let p be prime. Show that ifp 
divides |G], then p also divides |Z(G)]. 


CHAPTER 7 


Permutation groups 


1 Cyclic decomposition 


Let us recall that a cycle o = (a, ---a,) in S, is a permutation such that 
a(a;) = a,,,fori=1,...,r—1,0(a,) = a,, and o(x) = x for every other x 
in n. Two cycles (a, ---a,), (b, ---b,) in S, are disjoint permutations if and 
only if the sets {a,,...,a,} and {b,,...,b,} are disjoint. Note that a cycle 
of length r can be written in r ways, namely, as (a,:--a,) and 
(a,a; 4 ,°°°@,a,°*:a;_,), i=2,...,r. A cycle of length r is also called an 
r-cycle. 


1.1 Theorem. Any permutation a € S, is a product of pairwise dis- 
joint cycles. This cyclic factorization is unique except for the order in which 
the cycles are written and the inclusion or omission of cycles of length |. 


Proof. We prove the theorem by induction on n. If n = 1 the theorem is 
obvious. Let i, € (1,2,...,2}. Then there exists a smallest positive integer r 
such that o"(i,) = i,. Let i, =o(i,), i; = o(i,) = 0(i,),....1, = O(i,-1) = 
o*—*(i,). Next, let 


X = (1,2,..651) — (dy sig yneesl,)- 


If X = ©, then oisacycle and weare done. Solet X # ©, and leta* = o[X. 
Then o* is a permutation of a set consisting of m — r elements. By induc- 
tion o* = CC, °°" C,,, where c, are pairwise disjoint cycles. But since o = 
o*c,, where c, =(i; i,---i,), it follows that o is a product of disjoint 
cycles. 

To prove uniqueness, suppose o has two decompositions into disjoint 
cycles of length > 1, say o= y, °*' », = B, °°: B 


129 


130 Permutation groups 


Let x € n. If x does not appear in any y,, then a(x) = x. Hence, x does 
not appear in any f,. If x isin some y,, then a(x) # x. Hence, x must be in 
some f;. But then a(x) occurs in both y, and f; for every r € Z. Hence, y; 
and £8; are identical cycles. This proves that the two decompositions writ- 
ten earlier are identical, except for the ordering of factors. O 


1.2 Corollary. Every permutation can be expressed as a product of 
transpositions. 


Proof. We need only show that every cycle is a product of transpositions. It 
is easily verified that 


(Ay Gz +++ An) = (Ay A_)(Ay Ay — )+*-(Ay 22). LJ 


Note that decomposition into transpositions is not unique, and the 
factors need not be pairwise disjoints. For example, 


(1 2 3 4 S)=(4 5X3 5X2 5X1 5) 
=(5 244 2X3 21 5) 
=(5 342 1X3 SY4 5X2 33 5) 


Let cE S,. Let us write the decomposition of @ into disjoint cycles, 
including cycles of length 1, and write the factors in order of increasing 
length. Suppose o = y, -:: y,, and the cycles 7,,...,y, are of lengths 7, ,...,7,, 
respectively. Thenn, Sn, S°°:sSn,andn, +n, +++: +n, =n. Thus,o 
determines a partition (7, ,...,,) of n, which is called the cycle structure of 
o. Conversely, given a partition (7,,...,7,) of n, there always exists (not 
uniquely) a permutation o € S, such that the cycle structure of a 1s 
(n,,...,7,), for the product of any disjoint cycles y,,...,y, of length 7, ,...,7,, 
respectively, satisfies that condition. 


1.3 Theorem. If.a,o € S,, then t= aoa"! is the permutation ob- 
tained by applying a to the symbols in o. Hence, any two conjugate per- 
mutations in S, have the same cycle structure. 

Conversely, any two permutations in S, with the same cycle structure 
are conjugate. 


Proof. \f o(i) = j, then ta(i) = (aca™')a(i) = aa(i) = af j). Therefore, if 
(a, ---a,,) 1S a cycle in the decomposition of a, then (a(a,;)---a(a,,)) 1s a 
cycle in the decomposition of t. Hence, the cycle decomposition of t is 
obtained by substituting a(x) for x everywhere in the decomposition of a. 
Thus, o and t have the same cycle structure. 
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Conversely, suppose that o and t have the same cycle structure 
(p,q,...,7). Then o and t have cycle decompositions 
0 =(a, **+Ap)(Ap4 1 *Apag)  (An—r et “y), 
t= (by ++ b,)(bp4 1° B pag) (On- 417 On): 
Define a € S, by a(a,) = b,, i = 1,...,2. Then it is easily verified that 


aca '(b,) = t(b,), i= 1,...,2. Hence, aca” ' = t, and ao and t are conju- 
gate. O 
1.4 Corollary. There is a one-to-one correspondence between the set of 


conjugate classes of S,, and the set of partitions of n. 


Proof. Obvious. O 


1.5 Examples 


(a) Express the following permutations as a product of disjoint cycles: 


Eero ee 
25 16 4 3 


(ji) (4 2 1 5X3 4 2 6X5 6 7 1) 
(ii) (; (773 
3412 5 


Solution. (i) We start with 1 and obtain the cycle (1 2 5 4 6 3). 
(i1) The given permutation can be rewritten as 


Starting with 1, we obtain the cycle (1 4). Then start with any other 
element notin(1 4), say 2. Starting with 2, we get acycle(2 6 7 5 3). These 
two cycles exhaust the list (1,2,3,4,5,6,7}. Thus, 0 = (2 6 7 5 31 4). 

(iii) Proceeding as before, we obtain that the given permutation is 
(1 3)(2 45) or, by the convention of omitting cycles of length |, equal to 
(1 3X2 4). 


(b) Leta =(1 5 3X1 2), t=(1 6 7 9). Compute ota™', 


Solution. By Theorem 1.3, ota~' = (a(1) a(6) 0(7) 0(9)) = (2 6 7 9). 
The interested reader may actually verify this by first computing o~'. 


Thus, by composing the mappings we obtain 


,.(1 2345678 9\_ 
ota Sees 44. (26 7 9). 


Problems 


If a and £ are disjoint cycles, then af = Ba. 

2. Ifa@isacycle of length 7, then a’ is the identity permutation. 

3. Show that the order of anya € S, is the least common multiple of 
the lengths of its disjoint cycles. 

4. Any two cycles in S, are conjugate if and only if they are of the 

same length. 


2 Alternating group A, 


We showed in Section | that every permutation in S, can be expressed as a 
product of transpositions, and the number of transpositions in any given 
product is not necessarily unique. However, we have the following im- 
portant result showing that the number of factors is always even or always 
odd. 


2.1 Theorem. /fa permutation o € S,, is a product of r transpositions 
and also a product of s transpositions, then r and s are either both even or 
both odd. 


Proof. Let o = n,°°* n, = 1,°°° ,, where n, and nj are transpositions. 
Let P = Il,<,(x, — x,) be a polynomial in the variables x,,...,x,. Define 


n(P) = [Gunn — Xmyy)s 
i<j 

where 7 € S,. We show that if 7 is a transposition, then n(P) = — P. Let 
n= (k 1),k <1. Nowone of the factors in the polynomial Pis x, — x;,and 
in n(P) the corresponding factor becomes x, — X,. 

Any factor of P of the form x, — x,, where neither i nor jis equal to k or 
/, is unaltered under the mapping 7. All other factors can be paired to form 
products of the form +(x, — x,)(x, — x), with the sign determined by the 
relative magnitudes of i, k, and /. But since the effect of 7 is just to 
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interchange x, and x,, any such product of pairs is unaltered. Therefore, 
the above argument gives that the only effect of 7 1s to change the sign of P. 
This proves our assertion. 

Finally, 


o(P) = (n° n,)P = (— LY. 
Also 
o(P) = (m+: m)P = (— IP. 
Thus (— 1) = (— 1), proving that r and s are both even or both odd. O 


Definition. A permutation in S, is called aneven (odd) permutation ifit is 
a product of an even (odd) number of transpositions. 


The sign of a permutation o, written sgn() or €(0), is defined to be + 1 
or — |, according to whether a is even or odd. More generally, we define 
the sign of any mapping from n to n as follows. 


Definition. Let d:n—n. Then 


+1 if @ is an even permutation, 
e(d) = 4-1 if @ is an odd permutation, 
0 if dis not a permutation. 


2.2 Lemma. Let ¢,y be mappings from n ton. Then 


e(py) = e(d)e(y). 
Hence for anya € S,, e&(a—') = €(0). 
Proof. If d and y are both permutations, the result follows from Theorem 
2.1. If f or wis not a permutation, then @y is not a permutation. Hence, 


e(py) = 0 = e(d)e(y). If o is a permutation, €(a)e(o—') = e(o0-') = I. 
Hence, €(o~') = €(c). O 


Let A, denote the set of all even permutations in S,. 


2.3 Theorem. A, is anormal subgroup of S,. Ifn > 1, A, is of index 2 
in S,, and, hence, |A,| = 4n!. 


Proof. lf n= 1, S,; = {e} = A,. Hence, A, is trivially a normal subgroup. 
Let n> 1. Let G=(1,—1) be the multiplicative group of integers 1,— 1. 
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Consider the mapping 
@:S,—-G  givenby o+>«€o). 


By Lemma 2.2, ¢ is a homomorphism. Because n > 1, the transposition 
t=(1 2)isin S, and @(t) = — |. Hence, @ is surjective. Therefore, by the 
fundamental theorem of homomorphisms, S,/Ker @ = G. But Ker @ 
is precisely the set A, of all even permutations. Hence, A, is a normal 
subgroup of S,, and [S,:A,] =|G|= 2. Consequently, |A,|= 4|S,|= 
jn!. O 


Definition. The subgroup A,, of all even permutations in S,, is called the 
alternating group of degree n. 


2.4 Examples 


(a) In S; the identity and the cycles (1 2 3), (1 3 2) are the only even 
permutations. Hence, 


A, = ((1),(1 2 3),(1 3 2)). 


(b) A, is nonabelian if 2 > 3. For, if a, 5, c, d are distinct, then 
(abcXabd)=(acXbd), whereas (abd\abc)=(ad\Xbc). Hence, 
(abclfabd)#(abd\a bc). 

(c) The only possible even permutations in S, are 3-cycles and products 
of two transpositions, in addition to the identity. Hence, A, consists of the 
following elements: 


e=(1), a=(1 2X3 4) b=(1 3X2 4), c=(1 42 3), 
a=(1 2 3) B=(l 3 4), y= 4 2) d=(2 4. 3), 
a?=(1 3 2) B2=(1 4 3), y2?=(1 2 4), d2?=(2 3 4). 


It is easily verified that a, b, and c are conjugate to one another. For 
example, aaa~! = c. Similarly, a, 8, y, and 6 are conjugate elements, and 
so are a?, 82, y?, and d?. Thus, the conjugate classes in A, are {e), (a,b,c), 
{a,8,y,0), and {a?,B?,y?,d2). 

Note that two elements of A, may be conjugate in S, but not in A, . For 
example, a and a’, being of the same length, are conjugate in S, but notin 
Ay. 

(d) A, has only one proper normal subgroup, which is the Klein four- 
group. Consequently, A, has no subgroup of order 6. (This example shows 
that the converse of Lagrange’s theorem does not hold.) 

With the same notation as before, we easily verify that a? = b? = 
c? = e, ab=c, bc =a, ca = b. Hence, H = {e,a,b,c) is a subgroup of A,. 
Moreover, since a, b, and c constitute a conjugate class, H is normal. It is 
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clear from the relations satisfied by a, b, and c that H is the Klein four- 
group. 

Let K be any normal subgroup of A,. If x € K, then x? € K, and 
because K is normal, every element conjugate to x or to x? is also in K. 
Hence, if K contains any of the elements @,8.6,),a7,87,72,62, then K must 
contain all of them. But then |K|> 8. Hence, by Lagrange’s theorem,K = 
A,. This proves that H = (e,a,6,c} is the only normal subgroup of A,. 

Finally, any subgroup of order 6 of A,, being of index 2, would be a 
normal subgroup. Hence, A, has no subgroup of order 6. 


3 Simplicity of A, 


3.1 Lemma. The alternating group A, is generated by the set of all 
3-cycles in S,,. 


Proof. The result holds trivially for n = 1,2, for then A, = (e} and S,, does 
not have any 3-cycle. So let n > 2. Now every 3-cycle is an even permuta- 
tion and, hence, an element of A,. Conversely, every element in A, is a 
product of an even number of transpositions. Consider the product of any 
two transpositions o and t. If cand 7 are disjoint, say o = (a 6), t = (c a), 
then ot = (a b cb c ad). Otherwise, o = (a b), t = (b c)givesot = (abc). 
Thus, every even permutation 1s a product of 3-cycles. Hence, A, is gener- 
ated by the set of all 3-cyclesin S,. © 


3.2 Lemma. The derived group of S,, is A,. 


Proof. For n = 1,2, S) = (e) = A,. Consider n > 2 and let a = (1 2) and 
B= (1 2 3). Then 


afa'B'=(1 21 2 342 1X3 2 I=(1 2 = 3). 


Hence, (1! 2 3) € S). Because S;, < S,, S;, must contain every conjugate of 
(1 2 3). Hence, S’ contains every 3-cycle; therefore, A, C S?,. On the other 
hand, every commutator aba~'b~' in S/ is an even permutation. Hence, 
S;,C A,. This proves that S$} =A,. O 


3.3 Theorem. The alternating group A, is simple if n> 4. Conse- 
quently, S, is not solvable ifn > 4. 


Proof. Suppose H is a nontrivial normal subgroup of A,. We first prove 
that H must contain a 3-cycle. Let 0 # e be a permutation in H that 
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moves the least number of integers in n. Being an even permutation, ¢ 
cannot be a cycle of even length. Hence, o must be a 3-cycle or have a 
decomposition of the form 


o=(a b Cc seeder (1) 
or 
a=(a bXc ad):::, (2) 


where a,b,c,d are distinct. Consider first case (1). Because o cannot be a 
4-cycle, it must move at least two more elements, say d and e. Let a = 
(c d e). Then 


aca '=(c d ela bc -++)-(e d oc) 
_— (a b d eee ) coe 
Now let t = 0 '(aoa—'). Then t(a) = a, and t(x) = x whenever o(x) = x. 
Thus, t moves fewer elements than o. But t € H, a contradiction. 
Consider now case (2). With a = (c d e) as before, 


aca”“'=(c d eXa bXc d)-::(e d oc) 
=(a bd e)-::. 


Let B = o~(aoa™'). Then B(a) = a, 8(b) = b, and for every x other 
than e, B(x) = xifo(x) = x. Thus, 8 € Hand moves fewer integers than a, 
a contradiction. Hence, we conclude that o must be a 3-cycle. 

Let t be any 3-cycle in S, . Because any two cycles of the same length are 
conjugate in S,,t = aoa™' forsomea € S,. Ifa is odd, choose a transpo- 
sition £ in S, such that o and £ are disjoint. (This is possible because 
n> 4.) Then af € A,, and 

(af o)(aB)~' = af BoB-')ar! = acaé' = t. 
Hence, o and f are conjugate in A,. Therefore, t © H. Thus, H contains 
every 3-cycle in S,. Therefore, H = A,. Hence, A, has no proper normal 
subgroup, which proves that A, is simple if 2 > 4. 

Consider now the derived group of A,,. Because A}, is a normal sub- 
group of A,, and A, is simple, either A), is trivial or A, = A,. But A, is 
nonabelian if n > 3. Hence, A} is not trivial. Therefore, A, = A, for every 
n> 4. Consequently, S2) = A‘ = A,. Hence, S = A, for all k = 1. This 
proves that S, is not solvable. O 


3.4 Example 
A,, "> 4, is the only nontrivial normal subgroup of S,. 
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Solution. Let (e} # H<S,. Suppose HN A, # (e}. Then HN A, 4A, 
gives HM A, = A, because A, is simple. This implies A, C H. But then 
|S,,/A,|™= 2 implies |H/A,| = 2, so H = A, or S,. 

If HNA, = {e}, then |H A,| = 4H|n! s n!. Thus, |H|s 2 or |H|= 2 
because H # (e}. If H = (e,0), then non—'! = a Wn & S,. This implies o € 
Z(S,) = {e}, acontradiction. Hence, the only nontrivial normal subgroup 
of S,, is A,. 


Problems 
1. Prove that S, is generated by the cycles (1 2...n— 1) and 
(n— 17). 
2. LettH<S,,H¢ A,. Show that exactly half the permutations in 
H are even. 


3. Show that any group of order 4n + 2 has a subgroup of index 2. 

4. Let S, act on itself by conjugation. Let K and H be, respectively, 
the orbit and stabilizer of (1 2)€S,. Show |K|=10 and 
|H| = 12. 

5. Show that there are 420 elements of S, having disjoint cyclic 
decomposition of the type (a b)(c d_ e). Find also the 
number of elements in the orbit of (a b)(c d_e). 

6. Show that there are 1120 elements of S, having disjoint cyclic 
decomposition of the type (a b c)(d e f)(g_ hi). 

7. Consider a cube with vertices numbered | through 8. Let H be 
the subgroup of the group G of rotations and reflections 
keeping the vertex | fixed. Show H =~ S, and |G| = 48. 

8. Consider a regular tetrahedron with vertices numbered ! 
through 4. Let H be the subgroup of the group G of rotations 
and reflections keeping one vertex fixed. Show H~=S, and 
(G| = 24. 

9. Prove that every finite group is isomorphic to a subgroup of 
the alternating group A, for some n> I. 


CHAPTER 8 


Structure theorems of groups 


1 Direct products 


If a group is isomorphic to the direct product of a family of its subgroups 
(known as summands) whose structures are known to us, then the struc- 
ture of the group can generally be determined; these summands are like 
the “‘building blocks” of a structure. Theorem 1.1 answers the question, 
When is a group isomorphic to the direct product of a given finite family 
of its subgroups? 


1.1 Theorem. Let H,,...,H,, bea family of subgroups ofa group G, and 
le H=H, -:: H,. Then the following are equivalent: 


(i) H,™X +++ *H,=H under the canonical mapping that sends 
(Xp e0eyXp_) LOX, °°° Xy- 
(it) H,<H, and every element x © H can be uniquely expressed as 
X=X, °° X,,.%,E M,. 
(iii) H,<AH, and if x, «++ x, =e, then each x,= e. 
(iv) H,AH, and H,N(H, +: A, Ai, °° A) = (e), bsisn. 


Proof. (i) = (ii) Let Hj = ((e,...,4,,...,.e)|A, € H,). Trivially, Hid H, X 
-->  H,. Also H;~=H, under the given mapping. Thus, H,< 
H, -* H,. 

Further, let x = x, °°: X, =X} °° X4,xX,,X;E€ H;. Then (x),...,x,) 
and (x},...,X,) have the same image under the given isomorphism. Hence, 
X; = Xj. - 

(11) => (i11) x, *°* x, =e. Also e :*: e = e. By uniqueness of represen- 
tation, x,=e,lsisn. 
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(iii) => (iv) The hypothesis implies 1,9 H;={e), i #Jj. For let 
X, = X,,X, © H,, x, € H,. Then e = x,x7' implies x, = e = x; '. But then, 
since H,,H, <1 H, it follows that xy = yx for all x © H;, y € H;. 

To prove (iv), let 


Xp My XX ian 0 Xny 


where x;€H,, | <i<n. Then since all x;s commute, e= 
Xt Me pXT 4, °° X,, $0 by (iii) each x; = e. This proves (iv). 

(iv) => (i) First observe that for all x € H;, y € H,,1 # j, xy = yx. Then 
it follows immediately that the mapping o: H, X -:- X H, — H, given by 
O(X, y-..X,_) = X, °** X,, 1S a homomorphism. The mapping ts injective; 
for let x, -°: x, =e. Then xj7' =x, °°: x,. This implies by hypothesis, 
that x; ' = e,so.x, = e. The mapping is clearly surjective, proving (i). © 


Under any one of the equivalent statements in Theorem 1.1, we say 
that //is an internal direct product of H,,...,H,,. The emphasis on the word 
internal is more psychological than mathematical because the ‘‘external 
direct product” H, X ::- X H,,is isomorphic to H, -:+ H, in our situa- 
tion. Thus, we shall generally omit the word “internal’’ or “external” 
whenever the subgroups /,,...,H, satisfy any one of the conditions 
in Theorem 1.1. Note, however, that the external direct product 
H, X ++: X H,, always exists, whereas the internal direct product exists if 
and only ifthe canonical map H, X +: X H,— H, °:* H,,isanisomor- 
phism. 

If the group is additive, the (internal) direct product of subgroups 
H,,...,—,, of G is also written as H, © --- © H,,and called the direct sum 
of H,,...,H,,. 


1.2 Examples 


(a) If each element #e of a finite group G is of order 2, then 
|G|= 2" and G~C, x C, x --- x C,, where C; are cyclic and |C,| = 2. 


Solution. Now x? = e=> x = x7! for all x € G. So if a,b € G, then ab = 
(ab)~' = b-'a~' = ba. Let e #.a, EG. Then either G = [a,]. the cyclic 
group generated by a, ,or G ) [a,]. In the second situation, 3 a, € Gsuch 
that a, ¢ [a,]. Then the product [a,][a,.] is direct. If G = [a,][a,], we are 
through; otherwise G > [a,][a,] > [a,]. Continuing thus, we ultimately get 
G = [a,]{a,] --- [a,]. the direct product of cyclic groups each of order 2. 


(b) A group G of order 4 1s either cyclic of G~ C, x C), a direct 
product of two cyclic groups each of order 2. 
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Solution. If G is not cyclic, then by Lagrange’s theorem G must contain an 
element a of order 2 and also another element 6 of order 2. Obviously, 
G~[a] x [6]. 


(c) If Gis agroup of order pg, where pand gare distinct primes, and if G 
has a normal subgroup H of order p and a normal subgroup K of order gq, 
then G is cyclic. 


Solution. By Lagrange’s theorem HK = {e}. Since H and K are 
normal and Ho K = {e}, hk =kh for all he H and keK. Also then 
HK~HxK. If H=[h] and K =[k], then o(hk) = o(h)-o(k) = pq. 
Hence G is generated by hk. 


(d) If G is a cyclic group of order mn, where (m,n) = 1, then 
G =~ H x K, where H is a subgroup of order m, and K is a subgroup of 
order n. 


Solution. Because G is cyclic, there exist a unique subgroup // of order 
and a unique subgroup K of order n. By Lagrange’s theorem H NM K = (e}. 
Then by Example 3.11(e) in Chapter 4, |HK|=|H||K|= mn. Hence, 
G=HK ~H x K. 


(e) If Gisa finite cyclic group of order n = p{' -*: p§*, p, distinct primes, 
then G is a direct product of cyclic groups of orders pf, | s isk. 


Solution. Proceed as in (d). 


Problems 
I. Show that the group Z/(10) is a direct sum of H = {0,5) and K = 
(0,2,4,6,8). 


2. Show that the group (Z/(4), +) cannot be wnitten as the direct 
sum of two subgroups of order 2. 

3. Show that the group (Z/(8), +) cannot be written as the direct 
sum of two nontrivial subgroups. 

4. LettNI G=H~*X K.Prove that either Nis abelian or N intersects 
one of the subgroups H x (e), (e) X K nontrivially. 
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2 Finitely generated abelian groups 


An important question in the study of an algebraic structure is to know if 
it can be decomposed into a direct sum of certain “well-behaved” direct 
summands. In this section we show that any finitely generated abelian 
group can be decomposed as a finite direct sum of cyclic groups. This 
decomposition, when applied to finite abelian groups, enables us to find 
the number of nonisomorphic abelian groups of a given order (Section 3). 
We may remark that not every finitely generated group is finite (consider, 
for example, (Z,+)). 


2.1 Theorem (fundamental theorem of finitely generated abelian 
groups). Let A be a finitely generated abelian group. Then A can be 
decomposed as a direct sum of a finite number of cyclic groups C,. Pre- 
cisely, 


A=C,® our ©C,, 


such that either C,,...,C, are all infinite, or, for some j s k, C,,...,C, are of 
order m,,...,m,, respectively, with m,|mz|---|m,,and C,,,,...,C, are infi- 
nite. 


Proof. Let k be the smallest number such that A is generated by a set of k 
elements. We prove the theorem by induction on k. Ifk = 1, A isacyclic 
group; hence, the theorem is trivially true. Let k > 1, and assume that the 
theorem holds for every group generated by a set of k — 1 elements. 

First consider the case when A has a generating set (a, ,...,a,} with the 
property that, for all integers x,,...,x,, the equation 


Xa, + + +xX,a,=0 
implies 
X,™ os =x, = 0. 
This implies that every a € A has a unique representation of the form 
a=xX,a, + *- +x,a,, x,€ Z. 
For if 
A=X,A, + * XA, = XA, + 8 + X,Ay, 
then 


(x, — x{)a, Foe +X — Xa, = 0. 
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Hence, x,;=x;, (= 1,...,k. Therefore, by Theorem 1.1, A= 
C,@ +++ © C,, where C; = [a,] is the cyclic subgroup generated by a,, i = 
I,...,k. Moreover, since x,a,; = 0 implies x; = 0, C, is an infinite cyclic 
group. Hence, in this case, A is the direct sum ofa finite number of infinite 
cyclic subgroups. 

Now suppose that A has no generating set of k elements with the 
property stated in the last paragraph. Then, given any generating set 
(a,,...,€,}) of A, there exist integers x,,...,x, (not all zero) such that 
X,a,+ +++ +x,a, = 0. Since 2x,a; = 0 = 2(—x,)a; = 0, we can assume 
that x; > 0 for some /. Consider now all possible generating sets of A with k 
elements, and let_X denote the set of all k-tuples (x, ,...,.x,) of integers such 
that 


XQ, ++: + x,a, =0, x; >0 for some i, 
for some generating set (a,,...,a,} of A. Let m, be the least positive integer 
that occurs as a component in any k-tuple in_XY. Without loss of generality, 
we may take m, to be the first component, so that, for some generating set 
(GQ, 5---,4,), 

ma, + Xa, + ++: +x, a, =0. (1) 
By the division algorithm we can write 

X;= qm, +7, where 0 Sr; < m,, 
for each i = 2,,...,k. Then (1) becomes 

m,b, + rea, + as + r,a, = 0, (2) 


where 5, =a, + q,a, + --- +q,a,. Now 6, #0, for otherwise a, = 
— GQ, — *** —q,a,, which implies that A is generated by a set of k — | 
elements, a contradiction. Moreover, a, = b, — q,.Q,— °*' — QQ. 
Hence, (5, .a2,...,a,} iS a generating set of A. Therefore, by the minimal 
property of m,, it follows from (2) that r,= --- =r,=0. Hence, 
m,b, = 0. Let C, =[5,]. Because m, is the least positive integer such that 
m,b, =m,b, + 0a, + --+ +0a, = 0, C, is acyclic subgroup of order 771, . 

Let A, be the subgroup generated by (a,,...,a,}. We claim that 
A=C,@A,. So suppose x,b, € A, for some x,, 0S x, < m,. Then 
XO, = XpA_ Hes HXAy, Xz,...,X%, € Z. Hence, x,b, — x.a,—- ++ 
x, 4, =0, which implies x, =0 by the minimal property of m,. Therefore, 
C, OA, = (0), which proves our claim that A = C, © A,. 

Now 4, is generated by a set of k — | elements, namely, (a),...,a,). 
Moreover, A, cannot be generated by a set with less than k — | elements, 
for otherwise A would be generated by a set with less than k elements, a 
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contradiction. Hence, by the induction hypothesis, 
A,=C,9@- OG, 


where C,,...,C,, are cyclic subgroups that are either all infinite or, for some 
jsk, Cy,...,C; are finite cyclic groups of orders m),...,m,, respectively, 
with m,|m3|---|m,, and C; are infinite for i> j. 

Let C,; = [b,, 1 = 2,....k, and suppose that C, is of order m,. Then 
(b,,...,5,} iS a generating set of A, and 


mb, + mb, + 0b, + -°: + 0b, = 0. 
By repeating the argument given for equation (1), we conclude that 
m,|m,. This completes the proof of the theorem. © 


Evidently, if A isa finite abelian group, then C,,....C, are all finite. We 
prove in the next section that for a given A the positive integers /71,,..., 771; 
are determined uniquely. 


3 Invariants of a finite abelian group 
Throughout this section A denotes a finite abelian group written addi- 
tively. 


3.1 Theorem. Let A be a finite abelian group. Then there exists a 
unique list of integers m,,...,m, (all > 1) such that 


|A| =m, °° Mm, m,|m| -*° |m,, 


andA=C,® ++ OC,, whereC,,...,C, are cyclic subgroups of A of order 
mM,,....M,, respectively. Consequently, 


A=Z,, 9° ©Z,,. 
Proof. By Theorem 2.1, A =C, © --- © C,, where C,,...,C, are cyclic 


subgroups of orders m,,...,m,, respectively, such that m,|77,| -°- |/m,. 
Further, since |S < 7|=|5S]|7| for any finite sets S and 7 it follows that 


[AJ= 1Cyl os [Cal = my vr ty. 
Moreover, any cyclic group of order m is isomorphic to Z,,. Hence, 
A=Z,,,9O °: OZ. 
To prove the uniqueness of the list 772,,...,771,, Suppose 
A=C,@:-®C,=D,@---@D,, 
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where C,,D, are all cyclic subgroups of A, with 
IC,| =m,, m,|m2|---|m,, 
[Djj=nj, = my |n2|---[m. 


Now D, has an element of order 7,. But every element in A is of order 
< m,. Hence, n, S m,. By the same argument m, S n,. Hence, m, = n;. 

Now consider m,_,A = {m,_,ala © A}. From the two decompositions 
of A assumed above, we get 


mA = (m_,C1) © -° O(mM_,C,-,) © (M_1C) 
= (m,-,D,)@© +** © (m_,D)-1) © (m_,D). 


Because m,|m,_, for i= 1,....4 —1, it follows that m,_,C;, is trivial 
for i= 1,....4—1. Hence, |m,_,Al=|m,—,C,|=|m,_,D,|. Therefore, 
\m,—,D,\= | for j = 1,...,./— 1. Hence, m,_,|m,—,. By a symmetric argu- 
ment, m,_,|",-,. Therefore, m,_, = n;_,. Proceeding in this manner, we 
can show that m,_, = n,_,,7 = 0,1,2,.... Butm, °°: m, =|Al=n, o* Mm. 
Hence, k = / and m,=7,, i= 1,...,.k. O 


Definition. Let A be a finite abelian group. If 
A=Z,,,8---OLZm,, where 1<m,|m2|---|m, 


then A is said to be of type (m,,...,m,), and the integers m,,...,m, are called 
the invariants of A. 


The fundamental theorem of finite abelian groups enables us to deter- 
mine the number of isomorphism classes of abelian groups of order 7. A 
partition of a positive integer k is an r-tuple (kK, ,...,k,) of positive integers 
such that k = 2k, and each k, S k,,,. 


3.2 Lemma. There is a |-1 correspondence between the family F of 
nonisomorphic abelian groups of order p*, p prime, and the set P(e) of 
partitions of e. 


Proof. Let A & F. Then A determines a unique type (p“.,..., p*), where 
éjSe,S °°: Se, and e, te,+ -:: +e,=e. Define a mapping a: 
F — P(e) by oa(A) = (e,,...,&). iS Clearly injective (Theorem 3.1). To 
show that o@ is surjective, let (eé,,...,e,)€ P(e). Then the group 
Zy. © °** © Z,,, is obviously the preimage of (@,,...,e,). This completes 
the proof. O 
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The following lemma is an immediate consequence of Sylow’s 
theorems discussed in the next section. However, we give a proof based on 
the fundamental theorem of finitely generated abelian groups. 


3.3 Lemma. Let A be a finite abelian group of order p§' --- p&, p; 
distinct primes, e,>0. Then 


A =S(p:)®---BS(p_), where |S(p;)| = p7'. 
This decomposition of A is unique; that is, if 
A=H,@®---®H,, where |H;| = pj', 
then H; = S(p,). 
Proof. We have |A| = p{'-:- py", e: = 0. By Theorem 3.1, A=A,@°:: 
®A,, where A; are cyclic groups of orders, say, p{'---py*', ej = 9. 


Because A, is cyclic, A; contains unique subgroups A,,,..., A,,; of orders 
Di'\,.++>De's respectively. By Lagrange’s theorem 


Ay QV Ay o> Aji Ajoia °°* Ans = (e) 
for all j = 1,...,k. Therefore, by Theorem 1.1, 
A, = A,,® +++ O Ay. 
So 
A=[A,108--- OA J®---O[A1,B8 > @ Aga). 
Put 
S(D,) = [Ai © ++: OA,,),..., SC) = [Agr © +°* © Ay): 


Then |S(p,)| = p?.,..., |S(p)|= p#, and A = S(p,) © +--+ © S(p,). Then 
by comparing orders on both sides, it follows that |S(p,)| = p%,..., 
|\S(p,)| = pf. The last statement can be proved easily by observing that 
each of the subgroups S(p,) and H, is the subgroup 


(x € Alo(x) is a power of p,}. oO 


3.4 Theorem. Let n = If, pf, p, distinct primes. Then the number of 
nonisomorphic abelian groups of order n is If. ,|P(f)\. 


Proof. Let Ab, be the family of nonisomorphic abelian groups of order n. 
Let A € Ab,. By Lemma 3.3, A = S(p,)@ -*+ © S(p,). By Lemma 3.2 
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the number of nonisomorphic abelian groups S(p,) is |P(/;)|. Hence, 


k 
IAB, = [IPD = 9 
fm} 


3.5 Example 
Find the nonisomorphic abelian groups of order 360 = 23 - 3? - 5!. 


Solution. n = 360 = 23 - 32 - 5'.|P(3)| = 3, |P(2)| = 2, |P(1)| =_1. Hence, 
there are six nonisomorphic abelian groups of order 360, and these are 


7,92, ©Z,, 
Z,®Z,©Z,@Zz, 
2,902,092, OZ, 

Z, © Z,® Z, @ Z,@ Zz, 
2,02,072,0Z,0Z;, 
Z,® Z,®@Z,@Z,@Z,@ Zs. 


4 Sylow theorems 


In the last section we obtained the decomposition ofa finite abelian group 
A that provides a complete description of the structure of A. There is no 
such general result for finite nonabelian groups. The Sylow theorems, 
which occupy an important place in the theory of finite groups, are a 
powerful tool in studying the structure of finite nonabelian groups. In 
particular, the existence or nonexistence ofa simple group ofa given order 
can sometimes be determined by application of the Sylow theorems, 
among other machinery. 


Definition. Let G be a finite group, and let p be a prime. Let p™|\G\, 
p™*'Y|G|, m > 0. Then any subgroup of G of order p™ is called a Sylow 


p-subgroup of G. 


Definition. Let p be prime. A group G is called a p-group if the order of 
every element inG is some power of p. Likewise, a subgroup H ofany group 
G is called a p-subgroup of G if the order of every element in H is some 


power of p. 


4.1 Lemma (Cauchy’s theorem for abelian groups). Let A be a finite 
abelian group, and let p be a prime. If p divides|A|, then A has an element 
of order p. 
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Proof. We prove the theorem by induction on 7, the order of A, where pjn. 
Suppose that the result is true for all groups of orders m, p|m, and m <n. If 
| A} = por even if A is a cyclic group, then there exists a cyclic subgroup of 
A of order p, and therefore there exists an element of order p. Suppose now 
there exists b(# e) € Asuchthat A # [5], the cyclic subgroup generated by 
b. If p|{{b]|, we are done. But if p Y|[]|, then p divides the order of A/[5]. 
Hence, by the induction hypothesis there exists a € A/[b], a € A, such 
that o(a) = p. Let k = o(a). Then a‘ = e implies a* = e. Therefore, p\k, 
since o(a) = p. This means p divides the order of [a]; hence, some power 
of the element a is of order p. O 


4.2 Theorem (first Sylow theorem). Let G be a finite group, and let p 
be a prime. If p™ divides |\G\|, then G has a subgroup of order p™. 


Proof. We prove the theorem by induction on n =|G|. Ifn = 1, the result 1s 
obvious. So let us assume that the result holds for all groups of orders less 
than 7. 

If the order of the center of G is divisible by p, then it follows from 
Cauchy’s theorem for abelian groups that the center of G contains an 
element a of order p. The cyclic group C generated by a is clearly normal 
in G, and the quotient group G/C has order n/p, which is divisible by p”~'. 
Therefore, by the induction hypothesis, G/C contains a subgroup H/C of 
order p”—'!, proving the theorem in this case. 

Now consider the case when the order of the center is not divisible by p. 
The class equation of G is 


n=|G|=|Z(G)|+ DIG: Ma), 


where summation runs over one element from each conjugate class hav- 
ing more than one element. Now pjn, but p}/|Z(G)|. Hence, p}[G: N(a)] 
for some a € G, a €¢ Z(G). This implies that p™||M(a)| and |N(a)| <{GI. 

By the induction hypothesis (a) has a subgroup of order p”, and this 
yields the required subgroup of G. © 


A special case of the theorem is 


4.3 Corollary (Cauchy’s theorem). /f the order of a finite group G is 
divisible by a prime number p, then G has an element of order p. 


Proof. By Sylow’s theorem there exists a subgroup of order p. Therefore, 
there are at least p — | elements of order p (namely, all but the identity 
element of the cyclic subgroup of order p). O 
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4.4 Corollary. A finite group G is a p-group iff its order is a power of p. 


Proof. \f G is of prime power order, then each element has, trivially, order 
a power of p. So let us assume that each element in G has order a power of p 
and prove that order of G is a power of p. If possible, let g # p be a pnme 
number dividing the order of G. Then by Cauchy’s theorem there exists an 
element of order g, a contradiction. Hence, G is of order a power of p. O 


4.5 Theorem (second and third Sylow theorems). Let G be a finite 
group, and let p bea prime. Then all Sylow p-subgroups of G are conjugate, 
and their number n, divides o(G) and satisfies n, = | (mad p). 


Proof. Let K be a Sylow p-subgroup in G. Let C(K) denote the family of 

subgroups of G conjugate to K; that is, C(K) = {(xKx7~'|x € G). Then 
|C(K)| =|G/MK )|. (1) 

Because K is a Sylow p-subgroup, (|G/M(K)|,p) = 1; therefore, 
(J\C(K)I,p) = 1. 


Next let H be any Sylow p-subgroup. We want to show that // is 
conjugate to K. Regard the set C(K) as an //-set by conjugation. Then for 
any LE C(K), let C,,(L) = {ALA“"|h € H)}, the orbit of L. Obviously, 
C,,(L) C C(K), and 


C(K)= U C,(L) — (disjoint), (2) 
LECIK) 
where the union runs over one element L from each orbit [= conjugate 
class C,,(L)]. 
By Example 4.13 (b) in Chapter 5, 
|C(L)| = index of 1M ML) in H = p*, e= 0, (3) 


because H is Sylow p-subgroup. We first show that p* = 1 iff 77 = L. If 
H = L, then, trivially, p* = 1. Conversely, 


pP=1>H=HAML)-HCML)—AL<GandLIHA_e. 
But then 
HL/L= H/HO L=> HL/L isa p-group > HL = L, 


since L is a Sylow p-subgroup. So our assertion that p* = | iff H = L is 
established. 
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By (2) and (3) we have 


IC(K)| = Spt. (4) 


But then (1) implies that in the summation {p*¢ at least one term p* must 
be equal to 1, and this in turn implies, by what we established 1n the last 
paragraph, that H must be equal to some conjugate L of K, proving the 
second Sylow theorem. 

Because all Sylow p-subgroups of G are conjugate to K, it follows that 
their number n, is equal to |C(K)|=|G|/|N(K)|. Hence, n,}|G}. 

It is also clear that there is one and only one term in the sum 2p* that is 
equal to 1, so we have from (4) that the number of distinct conjugates of K 
1S 

nj=1+ ¥pt=1+kp=1 (mod p), 
e>0 


proving the third Sylow theorem. O 


4.6 Corollary. A Sylow p-subgroup of a finite group G is unique ifand 
only if it is normal. 


4.7 Examples 


(a) If a group of order p” contains exactly one subgroup each of orders 
D,p?,..., p"', then it is cyclic. 


Solution. Let G be a group of order p”, and let H be a subgroup of order 
p"-'. By Sylow’s first theorem H contains subgroups of orders 
D, p’,..., p"- 2. Because these subgroups are also subgroups of G, it follows 
that all the proper subgroups of G are contained in H. Now if a€& G, 
a & H, then the order of a must be p”, otherwise a will generate a subgroup 
K of order < p”. This would mean a € H, because a € KC H. Thus, the 
order of a is p", and G is therefore a cyclic group generated by a. 


(b) If d is a divisor of the order n of a finite abelian group A, then A 
contains a subgroup of order d. 
Solution. If n= pf! +++ pé, then A = S(p,)® +:: © S(p,), and 
(S(p) |= pf, 1sisr (Lemma 3.3). 


Let d= ph ++ pf&, By Sylow’s first theorem each S(p,) contains a 
subgroup S’(p,) of order p’. Then B= S’(p,)@ +++ © S’(p,) is a sub- 
group of order d. 
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Note. Example (b) proves the converse of Lagrange’s theorem for abe- 
lian groups. 


(c) Prove that there are no simple groups of orders 63, 56, and 36. 


Solution. The number of Sylow 7-subgroups is n, = 1 + 7k, and 1 + 7k 
divides 32 - 7. But then | + 7k divides 32. This yields k = 0. Therefore, 
there is a unique Sylow 7-subgroup that must be a normal subgroup. 

For groups of order 56, the number of Sylow 7-subgroups n, = | + 7k 
must divide 23 - 7. Thus, (1 + 7k)|23. This yields n, = | or 8. Ifn, = 1, we 
are done. If, = 8, then there are 8 - 6 = 48 elements of order 7. But then 
the remaining eight elements must form a unique Sylow 2-subgroup. So 
again we have a normal subgroup, and therefore G cannot be a simple 
group. 

The nonsimplicity of groups of order 36 may be proved similarly. 
Here is an alternative method, which is sometimes easier. Let H be a 
Sylow 3-subgroup of G. By Theorem 4.4, Chapter 5, there exists a 
homomorphism 9:G—S,, since the index of H in G is 4. Note 
Ker g = {e} implies GG S,, which is not possible since |G| = 36 and 
\S,] = 24. Also Kerg=c,.gxHx~'#G. Thus Kerg is a nontrivial 
normal subgroup of G and so G is not simple. 


(d) Let G be a group of order 108. Show that there exists a normal 
subgroup of order 27 or 9. 


Solution. |G| = 108 = 22 - 33. The number of Sylow 3-subgroups is 1, = 
1 + 3kand (1 + 3k)|22 - 33. Thus, (1 + 3k)|22. This implies k = 0 or 1. If 
k = 0, we have a unique Sylow 3-subgroup that ts then a normal subgroup 
of order 27. If k = 1, we have four Sylow 3-subgroups. Let H and K be two 
distinct Sylow 3-subgroups. Then from the result |HK|=|H||K|/|H N K{| 
(Example 3.1 1(e), Chapter 4), |H/K|= (27 + 27)/|H ON K|s 108. This gives 
27/4<|HN KI, soj\HN K|=9. 

Further, 410 K ( Hand HN K <1) K because subgroups of order p"~' 
are normal in a group of order p” (or, alternatively, HN K=HN 
NK) <1 #H). Now consider MH 1 K). Because HN K is a normal sub- 
group in H as well as in K, 


MHOAK)DH and MHOK)D K. 
Then HK C MHN0 K). Because 


[| + |K|_ 27-27 
IHN K| 9 


|HK|= = 81, 


Sylow theorems 15] 


and MHOK)<G, |MHO K)|= 108. Thus, M(/710K)=G. Hence, 
HO KAG. 


(e) If 771s a normal subgroup ofa finite group G, and if the index of Hin 
G is prime to p, then H contains every Sylow p-subgroup of G. 


Solution. Let |G|= pg, (p,g) = 1. Because the index of H - that is 
|G\/|H| - is prime to p, |H|= p’q,, (p.q,) = 1. By Sylow’s first theorem, 
H (as a group by itself) contains a Sylow p-subgroup K, and, clearly, 
|K| = p”. Because |K| = p”, K is also a Sylow p-subgroup of G. 

Let K, be another Sylow p-subgroup of G. Then by Sylow’s second 
theorem K, = xKx~' for some x € G. Then 


K,=xKx"'C xHx"'CH, since 17 JG. 
Hence, all Sylow p-subgroups of G are contained in H. 


(f) Let G be a group of order pq, where p and q are prime numbers such 
that p > g and q }(p — 1). Then G is cyclic. 


Solution. By the third Sylow theorem the number n, of Sylow p-subgroups 
is equal to | + kp for some positive integer k. Hence, 


pq = X(1 + kp), 
for some positive integer A. This implies p|A, and thus 
qg=A,(1 + kp) 


for some positive integer A,. 

But since p > q, g = A,(1 + kp) is not possible unless k = 0, and then 
we get by Corollary 4.6 that there is only one Sylow p-subgroup. say //, 
that must be normal. 

Further, the number n, of Sylow q-subgroups is equal to 1 + k’q for 
some integer k’, and by the same argument 


p= u,(1 + k’q) 


for some positive integer {/,. 

But since g (p — 1), p=, (1 + k’q) is possible only if k’ = 0, and 
this implies that there is a unique Sylow g-subgroup, say K. K is normal by 
Corollary 4.6. 

It is easy to see that KN H = (e}, and, hence, Ai = Ak Wh E H and 
Vk € K. This implies that if 1 and ) generate the cyclic groups H and K, 
respectively, then the order of the element xy is equal to |//||K| = pq. 
Hence, G is generated by the element xy, and, thus, G is cyclic. 
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Solution. Follows from (f). 
(h) There are only two nonabelian groups of order 8. 


Solution. Consider a nonabelian group G of order 8. Then G contains no 
element of order 8; otherwise G is abelian. If each element of G is of order 
2, then G is abelian. Thus G contains an element a of order 4. Let b€ G 
such that b ¢ [a]. Then G = [a] U [a]b, and b? € [a]. Now 6? =a or 
a? => o(b) is 8, and G is abelian. Thus, we must have b? = e or a?. Because 
[a] is of index 2 in G, {a} <1 G. Hence, b~'ab € [a]. Because 0(b~'ab) = 
o(a), we have b-'ab = a or a>. Hence, b-'ab = a’. 

Thus, we have two nonabelian groups of order 8: 


(i) The group G generated by a and b with defining relations 
a=e, b=e, b'ab=a’. 
(ii) The group G generated by a and b with defining relations 
a=e b?=a*, b'ab=a’. 
The first is the octic group, and the second is the quaternion group. 
That they are nonisomorphic 1s easily checked. 


Incidently, there are three, and only three, abelian groups of order 8 
that are of types (2,2,2), (2,4), (8). 


5 Groups of orders p’, pq 


We recall that there are cyclic groups of every possible order, that two 
cyclic groups of the same order are isomorphic, and that a direct product 
of cyclic groups is abelian. We denote by [x] the cyclic group generated by 
x, and, more generally, by [x,}'] the group generated by x,y. Throughout 
this section p and gq are distinct primes. 

We also recall the following properties of a finite p-group P: 


(i) 2Z(P) is nontrivial. 
(ii) If H isa proper subgroup of P, then M(H) 2 #1. 
(iii) If|P|=p", then every subgroup M of order p”~' is normal. 


In the following, two groups isomorphic to each other are identified. 


5.1 Groups of order p? 


We have already shown (Corollary 4.11, Chapter 5) that a group of order 
p? is abelian. Hence, there are two, and only two, groups of order p?: 
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(i) abelian group of type (p?), and 
(it) abelian group of type (p,p). 


§.2 Groups of order pq, g > p 


Let G be a group of order pq. Then the number of Sylow q-subgroups of G 
is | + Ag, which divides pq. This gives | + Aq = 1, and thus there is 
exactly one Sylow q-subgroup, which, being of order gq, is cyclic. Let [5], 
where 5b? = 1, be this unique Sylow g-subgroup. 

Also the number of Sylow p-subgroups is | + up, which divides pq. 
This gives | + up = 1 or q. 

Case (i). Suppose the number of Sylow p-subgroups in G is |, and let 
this be [a], where a? = 1. Since [a] and [4] are normal in G, and 
[a] M (b] = {e), we have G = [a] X [b]. Because 0(ab) is pg, G = [ab], and, 
thus, G is cyclic. 

Case (ii). The number of Sylow p-subgroups is qg. In this case g = 1 
(mod p). Let [a] be one of the Sylow p-subgroups of G, where a? = 1. Then 
the subgroup [a,b] contains a subgroup [a] of order pand a subgroup [5] of 
order g; thus, |[a,5]| = pq, whence G = [a,b]. Because [b] <1 G, a~'ba = b’ 
for some integer r. Then r # 1 (mod q), for otherwise ba = ab, and G is 
abelian, which in turn implies that the number of Sylow p-subgroups is 1, 
a contradiction. Thus, G is generated by a and b with defining relations 


qe=|=)p4, a~'ba = b’. (1) 
The integer r in (1) is a solution of the congruence equation 
Z’=1 (mod q) (2) 


for 


a~'ba = b' = (a~'ba)(a~'ba) = b*” = a~'b?a = b*” 
=a™'b'a=b" by induction 
= a-'(a~'ba)a = b” = a~*ba? = b” 
=a-’ba’=b” iby induction 
=>b=h"=>;?=1 (mod q). 


Conversely, if r is a solution of the congruence equation (2), then the 
defining relations (1) determine a group consisting of the pq distinct 
elements a/b’, Os i<p-10sj<q-l. 

We note that the solutions of Z° = | (mod gq), Z * | (mod q) arer,r’...., 
r°~' and all yield the same group, because replacing a by a/ as a generator 
of [a] replaces r by r’. 

We conclude that there are at most two groups of order pq: 


(i) The cyclic group of order pq. 
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The nonabelian group generated by a and b with defining rela- 
tions a? = | = b%, a~'ba= b’, r?= 1 (mod gq), r# 1 (mod q), 
provided p divides q — 1. 


Problems 


2. 


Suppose a group G = H X K is a product of groups H and K. 
Prove that G/H X (e) = K. 

Let G be a cyclic group of order 1, and let be a cyclic group of 
order n. Show that G X #H is cyclic if and only if (77,7) = 1. 
Prove that the group of all rational numbers under addition can- 
not be written as a direct sum of two nontrivial subgroups. 
Show that the set of positive integers less than and prime to 21 
form a group with respect to multiplication modulo 21. Prove 
that this group is isomorphic to H X K, where H and K are cyclic 
groups of orders 6 and 2, respectively. 

Prove that if the order of an abelian group is not divisible by a 
square, then the group must be cyclic. 

Show that an abelian group of order p* and of type ( p*',p*,...,p") 
contains p‘ — | elements of order p. 

Show that a finite abelian group that is not cyclic contains a 
subgroup of type (7,7). 

Show that a group of order 1986 is not simple. 

Prove that any group of order 2p must have a normal subgroup of 
order p, where p is prime. 

Show that a group of order p2q, p and gq distinct primes, must 
contain a normal Sylow subgroup and be solvable. 

Prove that a group of order 200 must contain a normal Sylow 
subgroup. 

Let G be a simple group and let 1, be the number of Sylow 
p-subgroups, where p is prime. Show |G] divides (n,)!. Using this 
or otherwise, prove groups of order 48 cannot be simple. 

If the order of a group 1s 42, prove that its Sylow 7-subgroup is 
normal. 

If is a divisor of the order of an abelian group, prove that the 
number of solutions of x” = ein G is a multiple of n. 

If each Sylow subgroup of a finite group G is normal, prove that G 
is the direct product of its Sylow subgroups. 

Prove that any group G of order 12 that is not isomorphic to A, 
contains an element of order 6. Deduce that G has a normal 
Sylow 3-subgroup. 


17. 
18. 


19. 


20. 


21. 


22. 


23. 


24. 
25. 


26. 


27. 
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Let G be a group. Prove that |G/Z(G)| ¥ 77. 

Let A be an abelian subgroup of G. Suppose A <1 G and |G/A| = 
91. Prove that G is solvable. 

Let G be a nonabelian group of order p*, where pis prime. Show 
that Z(G) = G’. What is the order of Z(G)? 

Suppose G is a finite group of order p,p,---p,, where p, are 
distinct primes. Show that G contains a normal Sylow sub- 
group and G is solvable. 

Let G be a finite group with a Sylow p-subgroup P. Let N = MP). 
Show that any subgroup of G that contains N (N included) ts 
equal to its own normalizer. 

Let G be a finite group, and let P be a Sylow p-subgroup of G. Let 
N ‘AG. Show that NN P is a Sylow p-subgroup of N. Also, show 
that NP/N is a Sylow p-subgroup of G/N. 

Prove that a finite group G is nilpotent if and only if it is the direct 
product of its Sylow subgroups. [//int: First show that if 77 < G, 
then M(H) # H. Then use Problem 21 to obtain that if H is a 
Sylow p-subgroup, then H <4 G.] 

Prove that A, is the first nonabelian simple group. 

Suppose P is a Sylow p-subgroup of a group G. Let H < G such 
that |H| = p”, m > 0. Show that HN MP)= HNP. 

Let P be a Sylow p-subgroup of a finite group G. If M isa 
normal p-subgroup, show that M ¢C P. Show also that G con- 
tains a normal p-subgroup H that contains all normal p- 
subgroups of G. 

Let G be a finite group such that G = HK, where H <G, K <G. 
For a prime p, if P is a Sylow p-subgroup of G, show that 
x” 'PxK is a Sylow p-subgroup of K for some xeH. 


PART III 


Rings and modules 


CHAPTER 9 


Rings 


] Definition and examples 


Definition. By a ring we mean a nonempty set R with two binary opera- 
tions + and -, called addition and multiplication (also called product), 
respectively, such that 


(i) (R,+) is an additive abelian group. 
(ii) (R,-) is a multiplicative semigroup. 
(iii) Multiplication is distributive (on both sides) over addition; that is, 
for all a,b,c, € R, 


a:‘(b+c)=a:‘bta:c, (a+ b):-c=a:cectbre. 


(The two distributive laws are respectively called the left distributive law 
and the right distributive law.) 


We usually write ab instead of a - b. The identity of the additive abe- 
lian group is called zero element of the ring R and is unique. We denote the 
zero element ofa ring by 0. Further, the additive inverse of an element a of 
the additive abelian group (R,+) shall, as usual, be denoted by — a. Thus, 
in a ring R, 

a+(—a)=0 for allae R. 


Further, if a,b € R, we denote a + (—b) by a — b. 
The most familiar examples of rings are 


Z: the ring of all integers, 
Q: the ring of all rational numbers, 
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R: the ring of all real numbers, 
C: the ring of all complex numbers. 
Other interesting examples are 


C[0,1J: the ring of all continuous functions from the interval 
[0,1] to R, 

R*: the ring of all functions from a nonempty set X to the real 
numbers R. 


The addition in each of the rings is the usual addition of mappings; that is, 


(f + gXx) = f(x) + g(x). 
We define multiplication by 


(fghx) = f(x)g(x). 


Another example of a ring is an additive abelian group A in which 
multiplication is defined by 


ab =0 for all a,b € A 


The ring A so obtained is called a trivial ring. We define a trivial ring as one 
in which the product (or multiplication) of any two elements is 0. 

An example of a finite ring — that is, a ring with finite number of 
elements - is Z, (or Z/(n)): the ring of integers modulo n. The elements of 
Z,, are the residue classes mod n; that is, Z, = (0,1,...,.(m — 1)}. Addition 
and multiplication in Z, are defined as follows: 


X+ty=xty, xy = Xy. 
(See Example 3.1! in Chapter 4.) To illustrate, we give the addition and 
multiplication tables of Z,. 


O;O 1 2 3 010 9 0 0 
ly1 23 0 1jO 1 2 3 
2;2 301 2;0 20 2 
313 0 1 2 310 3 2 1 


A commutative ring isa ring R in which multiplication is commutative; 
that is, ab = ba for all a,b € R. If a ring is not commutative it is called 
noncommutative. 

A ring with unity is a ring R in which the multiplicative semigroup 
(R,:) has an identity element; that is, there exists e€ R such that 
ae = a= ea for all a€ R. The element e is called unity or the identity 
element of R. Generally, the unity or identity element is denoted by |. 
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2 Elementary properties of rings 


2.1 Theorem. Let R be a ring. Then for all a,b,c € R, 
(i) aA=0=0a. 
(ii) a(—b) = —(ab) = (—a)b. 
(iti) a(b—c)=ab— ac, (a— b)c = ac — be. 
Proof. (i) 
a0 = a(0 + 0) = a0 + a0. 
Thus, 
aQ + (—(a0)) = a0. 
So, 
0 = a0. 
Similarly, 
0 = 0a. 
(ti) 
0 = a0 = a(b + (—b)) = ab + a(—b). 
Thus, 
— (ab) = a(— b). 
Similarly, 
— (ab) = (—a)b. 
(iii) 
a(b— c) = a(b + (— c)) = ab + a(—c) = ab — ac. 
Similarly, 
(a- b)c=ac—be. O 


Let a, ,a2,...,a, be a sequence of elements of a ring R. We define their 
product inductively: 
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We then have the following result, which can be directly proved by induc- 
tion on m and n. (See Theorem 4.2 of Chapter 1.) 


2.2 Theorem 
m n m+n 
(Ti a)(T] ants) = I] Qj; 
= j= i= 


for all a, ,Qz,..,Am4, ina ring R. 


This result, called the generalized associative law for products, essen- 
tially asserts that we can insert parentheses in any manner in our product 
without changing its value. The uniquely determined product of the 
SEQUENCE Q, ,@>,...,a, is denoted a,a,°°-a,. In the same way we can de- 
fine uniquely the sum of the sequence 4a,,d,,...,.a, and denote it 
a, +a,+ °*: +a,. Also, by using induction on m and n, we can easily 
prove the generalized distributive law given in 


2.3 Theorem 


(a, +a,+ ++ +a,,)(b, + b, + -+> + b,) = a,b, + a,b, + ++ +a,5, 
+ a,b, + +++ + ab, + +°* + yb, + Agby + +++ + and, 


for all a,,....a,, and b,,...,b, in a ring R. 


Ifa € Rand mis a positive integer, we write 
m times m times 

ee aN a en 

a“=a:a-"'a and ma=atat::: +a. 
If mis a negative integer, then ma stands for 
—m times 

aN, 
(—a) +(— a) + -*- +(—a). 


If a has an inverse a~! € R, then a~™” denotes the element (a~')”. If 
1 € R, then we define a® = 1. Also we define Oa to be the zero element of 
the ring. 


2.4 Theorem. (a) For all positive integers m and n and for alla ina 
ring R, the following hold: 
(i) aa" =a™t". 
(ii) (a"y=a™. 
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(b) For all integers m and nand for alla,bina ring R the following hold: 
(iii) ma+na=(m+n)a. 


(iv) m(na) = (mn)a. 
(vy) (ma\Xnb) = (mn\ab) = (naXmb). 


The theorem may be proved by using induction on m and n. It is left as 
an exercise for the reader. 


3 Types of rings 


Definition. A ring R whose nonzero elements form a group under multipli- 
cation is called a division ring. If, in addition, R is commutative, then R is 


called a field. 


The rational numbers Q, the real numbers R, and the complex num- 


bers C are examples of fields. 
An example of a division ring that is not a field is the ring of real 


quaternions. 
Let H be the set of 2 X 2 matrices of the form 


|-é 3h 


where a and b are complex numbers, and 4 and 5, respectively, denote 
their complex conjugates. Then H is anoncommutative division ring. For 


if 


is a nonzero matrix, then its determinant 6 = aa + bb ¥ 0, and, hence, 


a —b 
yinl eo 
b a 
é.O«Oéé 


Definition. A ring R is called an integral domain ifxy = 0, x E R, yE R, 
implies x =0 ory=0. 


The ring of integers Z is an integral domain. Every field (therefore 
every division ring) is an integral domain. (Let F be a field, and let 
xy =0, x,y € F. If x #0, then x7! exists, so x~'(xy) = 0. Thus, y = 0.) 
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However, an integral domain need not be a division ring or a field. (For 
example, the ring of integers Z is neither.) 


Definition. An element x in a ring R is called a right zero divisor if there 
exists a nonzero element y € R such that yx = 0. 


One can similarly define left zero divisor. An element is called a zero 
divisor if it is both a right and a left zero divisor. 

We may note that a ring R is an integral domain if and only if it has no 
right (or left) zero divisors except 0. 


3.1 Some important examples of rings 


(a) The ring of matrices 


Rudiments of matrices over any field have been given in Chapter 3. 
Matrices with entries in a ring R (i.e., matrices over R) and their addition 
and multiplication can be defined in exactly the same manner. 

Let R bea ring, and let R, be the set of all n X 1 matrices over R. Then 
R,, with the usual addition and multiplication of matrices is called the ring 
ofn X n matrices over R. lf n> 1 and Ris nota trivial ring, then R,, is not 
commutative. For if 


then ab # ba. It also follows that if n> 1, R, contains nonzero zero 
divisors; that 1s, R, is not an integral domain. 

Suppose K has unity 1. We denote by e,, the matrices in R,, whose (i,/) 
entry is | and whose other entries are zero. The e,,’s, | = i,j S 1, are called 
matrix units. From the definition of multiplication of matrices, it follows 
that e,e,, = 0 if j # k, and e,e, = e,; that is, 


C50 = Onis 


where 6, is the Kronecker delta. Also, if A = (a,;) € R,, then A can be 
uniquely expressed as a linear combination of e,’s over R; that is, 
A= I aye, ay R. 
Isiyan 
This representation of matrices in terms of matrix units is often a conve- 
nient tool in certain problems involving algebraic manipulations. 
Let S be the set of n X n matrices, all of whose entries below the main 
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diagonal are zero; that is, let S consist of matrices 


Qa) Qi? eee eee Qin 
a2 eee een Qon, 
: a, ER. 
0 0 eve eee Aan 


Then Sisa ring with the usual addition and multiplication of matrices and 
is called the ring of upper triangular matrices. Similarly, we have the ring 
of lower triangular matrices. 


(b) The ring of polynomials 


Let R be a ring. A polynomial with coefficients in R is a formal expression 
of the form 


Ay + A,X + agxX? + ++ +.,.x", a; E R. 


Two polynomials a) + a,x + +++ +.4,,x”" and bp + bx + °** +5,x" 
are called equal if m = n and a; = 5; for all i. Polynomials may be added 
and multiplied to form a ring in the following manner. Let 


S(X) = dg + a,x + a,x? + + +4,,x", 
B(x) = by + b,x + byx? + + +5,x". 
Then 
S(x) + g(x) = (ay + by) + (a, + b,)x + (a, + b,)x? + ane 
I(X)B(X) = Cy + CX H+ Cae Ht HO TT, 
where 


c= Y ab, OSismt+n. 
j+kei 


Alternatively, one may look upon 

Ag tayxt+ +++ +a,,x", a,€ R, 
as a sequence 

(Ap A; ,.--54,,50,0,...), a,€ R; 


that is, a function /;N— R, where f(i)=a,_,, | Sis m+, and 
Si) =0,i>m+1. 
Let bp + b,x + +++ + 5,x", b,E R, be another polynomial, and let 
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g:N—R be the corresponding function, so that g(i)=5,_,, |sis 
n+ 1 and g(i)=0, i>n+1. The sum f+ g of the functions fand g is 
defined to be the usual sum of functions, namely, 


(f+ ght) =f) + gi), 
or, equivalently, 


(dg + ayx t+ -+* +a,,X™) + (by + Ox + +++ + 5,X") 
= (dy + by) + (a, + b,)x + (a, + b,x? + ee 


The product of two functions fand g is defined to be the function /, where 
hi)= > fide,  ieN, 
Jrkawi+1 
or, equivalently, 


(dg + a,x +--+ +a,,X"\ bo + bx + +++ +5,x") 
= Cy OX Cx, 
where 
C= DY aby. 
jtkei 
It is now straightforward to verify that the set of polynomials (i.e., se- 
quences with finitely many nonzero entries) forms a ring called the poly- 
nomial ring over R, denoted by R[x]. 
Further, the mapping a — (a,0.0,...) of the nng R into the ring R[x] 
preserves binary operations; that is, if b — (6,0,0,...), then 


a+b—(a+t 6,0,0.,...) = (a,0,0,...) + (6,0,0,...), 
ab — (ab,0,0,...) = (a,0,0,...)(6,0,0,...), 


and the mapping is also 1-1. Thus, R[x] contains a carbon copy of R. By 
identifying the element a € R with the sequence (a,0,0,...)€ R[x], we say 
that R is a subring of R[x]. 

In what follows we assume | € R, and we denote by x the sequence 
(0,1,0,...). Then x? = (0,0,1,0,...), x? = (0,0,0,1,0,...), and so on. Consider 
a sequence (dap,4,,...,a,,0,...) with finitely many nonzero entnes. We find 


(Ap Qj y---sA,,0,...) = (Ag .0....) + (0,4, ,0....) + --> + (0,...,0,a, ,0....) 
=A taxt-: +a,x", 


which yields the familiar expression of a polynomial in x. We call x an 
indeterminate and refer to the polynomials in R[x] as polynomials in one 
indeterminate over R (sometimes we also call them polynomials in a 
vaniable x). 


Types of rings 167 


(c) Ring of endomorphisms of an abelian group 


Let M be an additive abelian group. Let End(A/) [or Hom(, A/)] denote, 
as usual, the set of endomorphisms of the group M into itself. We define 
addition and multiplication in End(A/) as follows: 


(i) (f+ Xx) =S(x) + g(r), 
(ii) (Sex) = f(g (x), 
for all fg € End(/) and x€ M. 
We show that the set End(/) together with the above binary opera- 
tions of addition and multiplication forms a ring. First we show that if 


fg€ End(M), then f+ g€ End(A/) and fe € End(Af). Let x,y © M. 
Then 


(f+ ext yaf(xty)+e(xty) — [by (i)] 
= (f(x) + f(y] + [e@) + 8) 
(because fand g are homomorphisms) 
= [f(x) + e600) + (£0) + 209) 
(because M is abelian) 
= (f+ gXx) + (s+ ey) [by (i)). 
This shows (f+ g) € End(). 
Further 


(sex + y) =f(g(x+y)) [by (ii)] 
= f{g(x)+2(y)) — (because g is a homomorphism) 
= f(g(x)) + f(2())) (because fis a homomorphism) 
= (feXx)+(fexXy) — [by (i)], 


which shows that fg € End(A/). Also it is easily shown that the two map- 
pings 
0:x-0 (zero endomorphism of 4) 
and 
Ix->x (identity endomorphism of M/) 
of M into M are also endomorphisms of M. 
If fE End(M), define —/fto be the mapping given by (— / .x) = —f(x) 
for all x € M. Then —f€ End(M) because, for any x,y € M, 
(—f x + y) = — (Se + y)) = — (£0) +f) 
= — f(x) — £09) = (SCO + (S10). 
Obviously, f+ 0 = fand f+ (—f) = 0 for all fe End(M). 
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Furthermore, along the same lines, it can be easily shown that for all 
fg,h © End(M), 


(f+ g)th=f+ (gt h), 
Sf+g=atys, 
(fe)h = f(gh), 

(f+ gh =fh + gh, 


and 


K(gth=fetfh. 
Hence (End(A/),+,-) 1s a ring. 


(d) Boolean ring 


Let & be the set of all subsets of a nonempty set A [called the power set of A 
and denoted by ¥(A)}. 
If a,b € B we define 


a+b=(aUb)—(anb), 
ab=anb. 


Then (%,+, « ) is a commutative ring with identity (which is the whole set 
A). The zero element of this ring is the empty set. Some of the important 
properties of the ring R are 


(i) a? =a, and 
(1) 2a=0, 


for allace 8. 

A ring R is called a Boolean ring if x? = x for all x € R. It can be easily 
shown that x? = x for all x € R implies 2x = 0. It is interesting to point 
out that any Boolean ring is known to be a subring of the ring of all subsets 
of a set. 


4 Subrings and characteristic of a ring 


Definition. Let (R,+,:) be a ring, and let S be a nonempty subset of R. 
Then S is called a subring if(S.+, -) is itself a ring. 


We note that addition and multiplication of elements of S are to 
coincide with addition and multiplication of these elements considered as 
elements of the larger ring R. Similarly, we can define a subdivision ring of 
a division ring and a subfield of a field. 
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Every ring R has two trivial subrings, (0) or simply 0 and R. Let R bea 
ring (with or without unity). A subring of R may have unity different from 
the unity of R [see Problem 10(b)]. The following result is frequently 
useful: 


4,1 Theorem. A nonempty subset S of a ring R is a subring ifand only 
if for alla,b E€ S we havea— bE Sandabe S. 


Proof (If part). The condition a ~— 6 € Sforalla,b € Simplies(S,+) is an 
additive subgroup of the group (R,+). The condition ab € S for all 
a,b € S implies (S, +) is a semigroup. Because the two distributive laws 
hold in R, they also hold in S. Hence, (S,+.,- )isasubring of (R,+,-). The 
converse follows from the definition of a subring. O 


Definition. Let R be a ring. Then the set Z(R) = (a € R\xa = ax for all 
x € R} is called the center of the ring R. 


4.2 Theorem. The center of a ring is a subring. 


Proof. Let R be a ring. Write Z(R) = S. Because 0 € S, then S # @. Let 
a,b Ee S, x € R. Then 


(a — b)x = ax — bx = xa — xb = x(a — b). 
Thus, a— bE S. Further, 

(ab)x = a(bx) = a(xb) = (ax)b = (xa)b = x(ab). 
Hence, ab € S. Therefore, by Theorem 4.1, Sisasubring. O 


Definition. Let S be a subset of a ring R. Then the smallest subring of R 
containing S is called the subring generated by S. 


Because the intersection of a family of subrings is again a subring, it 
follows that the subring generated by a subset S of R is the intersection of 
all subrings of R containing S. 

The subring generated by the empty set is clearly (0), and the subring 
generated by a single element a in R consists of all elements of the form 
na+n,a? + --» +n,a*, n,€ Z, and k a positive integer. 

We now define the characteristic of a ring. 


Definition. /f there exists a positive integer n such that na = 0 for each 
element a of a ring R, the smallest such positive integer is called the 
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characteristic of R. [f no such positive integer exists, R is said to have 
characteristic zero. The characteristic of R is denoted char R. 


For example, the ring of integers has characteristic zero. The ring Z/(n) 
of integers modulo n has characteristic 71. 


4.3 Theorem. Let F be a field. Then the characteristic of F is either 0 
or a prime number p. 


Proof. Let n # 0 be the characteristic of F. If e is the identity of F, then 
ne = 0. If possible, let n = n,n,, n, <n, n, <n. Then (1,n,)e = 0 yields 
(n,eX(n,e) = 0. But since F is a field, either n,e = 0 or nye = 0. But then 
either n,ea = 0 (i.e., n,a = 0) or n,ea = 0 (1.e., n,a = 0) for all aE R. 
Thus, the characteristic of F is = max (n,,n,) <n, a contradiction. This 
proves nis prime. O 


Definition. An element a ina ring R is called nilpotent if there exists a 
positive integer n such that a" = 0. 


0 is always nilpotent. The element (8 3) in a 2 X 2 matrix ring is also 
nilpotent because 


(3 o) ~(0 o} 


If R is an integral domain, then R does not possess any nonzero nilpotent 
elements, because for 0#a€R, we have a"=0=>a:-a""'=0, so 
a"—' = (0. Continuing like this we obtain a = 0. 


Definition. An element e ina ring R is called idempotent if e = e?. 
Clearly, 0 and | (if R has unity) are idempotent elements. 


Definition. Let A be an arbitrary ring, and let F bea field. Then we say that 
A is an algebra over F if there exists a mapping (a,x) > ax ofF X A into A 
such that 


(i) (a+ B)x=ax t+ Bx, a,pE F,xE A; 
(ii) a(xt+y)=axtay, ae F,x,yEaA; 
(iit) (aB)x = a(Bx), a,pE F,xE A; 

(iv) lx =x, LE F,xEA; 
(vy) a(xy) =(ax)y= x(ay), aé F,x,yEaA. 


The element ax is called the scalar multiplication of a and x. 
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Any field F can be regarded as an algebra over itself by defining scalar 
multiplication as multiplication of elements in F. More generally, if R isa 
ring containing a subring F C Z(R) such that F is a field, then R can be 
regarded as an algebra over F. Thus, the field of real numbers R can be 
regarded as an algebra over the field of rational numbers Q. Also, the 
polynomial ring Q[x] is an algebra over Q. 

We now introduce the concepts of direct product and direct sum of a 
family of rings. These will be discussed in somewhat greater detail in a 
later chapter. Because of their many-sided importance and also for our 
present need to construct some interesting examples, we define them 
here. 

Let R,, i= 1,2,..., be a family of rings. We construct from this family a 
new ring as follows. Consider the set 


R = ((a,,Q3,...)|a,; © R;) 


of sequences (a,,@,,...), which we write as (a,).We define binary opera- 
tions on R in the following natural way: 


(a, ,@>,...) + (b, ,b5....) = (a, + b,,a, + by,...) 
and 
(a, ,@,...) (b,,b5...) = (a,b, ,a2b,,...). 


It is then easy to verify that R is a ring. This ring R is called the direct 
product of the rings R,, i = 1,2,..., and often designated by I1,R,. The rings 
R; may be called the component rings of this direct product. Moreover, if 
(a, ,Ap,...,4;,...) © R, it is convenient to call a; the ith component of this 
element and to refer to addition and multiplication in R as component- 
wise addition and multiplication. 

The zero element of R is the element whose ith component is the zero 
of R, for all i = 1,2,.... Also, R will have unity ifevery R; has unity. The set 
R? of elements of the form (0,0,...,0,a;,0...) of R forms a subring of R. 

Consider a subset S of R such if (a;) € S, then all but a finite number of 
the components of (a,) are zero. It is easy to verify that Sis a subring of R. 
This ring S is called the direct sum of the family R;, i= 1,2,..., and is 
denoted by © £,R,. If the family of rings &; is infinite, then © , R; will 
not have unity even though each R; may have unity. 

Note that if the family of rings R, is finite, then direct product and direct 
sum coincide. 


4.4 Examples 


(a) Let R be a ring and let x € R. If there exists a unique a € R such that 
xa =x, then ax = x. 


172 Rings 
In particular, if R has a unique right unity e, then e is the unity of R. 


Solution. Clearly, x(a + ax — x) =x. So a+ax-—x=a and, hence, 
ax =x. 


(b) Let x be a nonzero element ofa ring R with |. If there exists a unique 
y€ R such that xyx = x, then xy = | = yx; that is, x is invertible in R. 


Solution. Let xr = 0, r€é R. Then x(y + r)x = x and, hence, y+ r= y. 
Thus, x7 = 0 implies r= 0. Now xyx = x gives x(yx — 1) = 0. Thus, 
yx — 1 = 0. Similarly, xy — 1 = 0, which completes the solution. 


(c) Let R bea ring with 1. Ifan element a of R has more than one right 
inverse, then it has infinitely many right inverses. 


Solution. Let A = {a’ € Riaa’ = 1}. By hypothesis A has more than one 
element. It follows then that @ does not have a two-sided inverse. Fix 
@) © A and consider the mapping a’ > a’a — | + ap of A to A. It can be 
easily checked that this mapping is |-1 but not onto (a does not have a 
preimage). Hence, A must be an infinite set because every 1-1 mapping of 
a finite set into itself is an onto mapping (Theorem 3.3, Chapter 1). 


(d) Let R be a ring with more than one element. Ifax = bhasasolution 
for all nonzero a € R and for all b € R, then R is a division ring. 


Solution. First we show R is an integral domain. If possible, let a #0, b #0, 
and ab = 0. Then abx = 0 forall x € R. Now, by hypothesis, forallc € R, 
bx = c has a solution. Thus, ac = 0 for every c € R. Because, for some 
c€ R,ac = a, we get a = 0, acontradiction. Hence, R must be an integral 
domain. 

Next let x = e€ R be a solution of ax = a, a¥0. Then e* 0 and 
a(e — e?) = 0. But this implies e — e? = 0. Thus, e is a nonzero idempo- 
tent. 

We assert that e is an identity element in R. Now for all x € R, we have 
(xe — x)e = xe? — xe = 0, so xe — x = 0. Similarly, e(ex — x) = 0 yields 
ex = x. This proves our assertion. Finally, if0 # a € R, then there exists 
bER such that ab =e. Also, (ba — e)b = bab —- b=0; thus, ba =e. 
This proves that each nonzero element a € R has an inverse. 


(e) Let R be a ring such that x° = x for all x € R. Then R is commuta- 
tive. 
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Solution. x? = x for all x € R implies (x + x)? = (x + x) for all x E R. 
Thus, 6x = 0 for all x € R. Also, (x? — x)? = x? — x implies 3x? = 3x 
after simplifications. 

Consider S = (3x|x € R). It can be easily checked that Sis a subring of 
R and, for yE S, 


y? = (3x)? = 9x? = 6x? + 3x? = 3x? = 3x=y, 


Thus, y?= y for all y€ S implies S is commutative, so (3x\3y) = 
(3y)(3x), that is, 9xy =9yx, which implies 3xy = 3yx. Now (x + y)? = 
x + yimplies 


xy? + x2yp + xypx + px? + yxy + y*x =0, (1) 
and (x — y)} = x — y implies 

xy? — x*y — xyx — px? + pxy + y*x = 0. (2) 
By adding (1) and (2) we get 

2xy? + 2yxy + 2y*x =0. 
Multiply the last equation by y on right and by y on left: 


2xy + 2yxy? + 2y*xy = 0. (3) 
2yxy? + 2y*xy + 2px = 0. (4) 


Subtract (3) from (4) to get 
2xy = 2yx. 


Because 3xy = 3yx, we have xy = yx for all x,y € R. Hence, R is com- 
mutative. 


Problems 


1. Let S=C{[O0,1] be the set of real-valued continuous functions 
defined on the closed interval (0,1], where we define {+ g and /z, 
as usual, by (f+ g) (x) = f(x) + g(x) and (fg) (x) = f(x)g(x). Let 
O and | be the constant functions 0 and 1, respectively. Show that 


(a) (S,+,-) is a commutative ring with unity. 

(b) Shas nonzero zero divisors. 

(c) Shas no idempotents #0,1. 

(d) Leta e€[0,1]. Then the set 7 = ({ feE S|f(a) = 0) isa subring 
such that fg, ef € T for all fE TandgeS. 
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Let R be an integral domain and a,b € R. If a” = b”, a" = b", 

and (m,n) = 1, show that a = b. 

(a) Show that the following are subrings of C. 

(i) A={(a+b V—l]a,b € Z). . 
(ii) B=(a+b V—3jeither a,b € Z or both a,b are halves 
of odd integers). 

The set A is called the ring of Gaussian integers. 

(b) Let ebe an idempotent ina ring R. Show that the set eRe = 
{eaela € R} is a subring of R with unity e. 

Show that an integral domain contains no idempotents except 0 

and | (if 1 exists). 

(a) Determine the idempotents, nilpotent elements, and inver- 
tible elements of the following rings: 

(i) Z/(4) (1) Z/(20) 

(b) Showthat the set U(R) of units ofa ring R with unity formsa 
multiplicative group (cf. Problem 18(c)). 

(c) Prove that an element x € Z/(7) is invertible if and only if 
(x,n) = 1. Show also that if(x,n) = 1, then x®” = 1 (mod n), 
where (7) is Euler's function (this is called the Euler- 
Fermat theorem). 

Let S be the set of 2 X 2 matrices over Z of the form (g 2). Show 

that 

(a) Sisaring. 


k 

(b) (< *) = (< a where * denotes some integer. 

Also find the idempotents and nilpotent elements of S. Show that 

nilpotent elements form a subring. 

Ifaand bare nilpotent elements ofa commutative ring, show that 

a + bisalso nilpotent. Give an example to show that this may fail 

if the ring is not commutative. 

Prove that the following statements for a ring R are equivalent: 

(a) Rhas no nonzero nilpotent elements. 

(b) Ifae R such that a* = 0, then a= 0. 

Furthermore, show that under any of the conditions (a) or (b) 

all idempotents are central. 

Show that the characteristic of an integral domain is either 0 ora 

prime number. 

(a) Ifaring R with | hascharacteristic 0, show that R contains a 
subring that is in 1-1] correspondence with Z (this subring 1s 

(b) Let S=(0,2,4,6,8). Show that S is a subring of Z,o with 
unity different from the unity of Zyo. 
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Show that a finite integral domain is a division ring. 

If x* = x for all x in a ring R, show that R is commutative. 

Let a be an element in a ring R with unity. If there exists an 
invertible element u € R such that aua = a, show that ab = | for 
some b € R implies ba = 1. 

Let R be a ring with unity such that for each a € R there exists 
x € R such that a2x = a. Prove the following: 


(a) Rhas no nonzero nilpotent elements. 

(b) axa — ais nilpotent and, hence, axa = a. 

(c) ax=xa. 

(d) axand xa are idempotents in the center of R. 

(e) There exists y€ Rsuch that a?y = a, y*a = y,anday= ya. 

(f) aua=a, where u= 1 + y— ay is invertible, and y€ R is 
chosen as in (e). 

Let R be a ring with left identity e. Suppose, for each nonzero 

element ain R, there exists bin Rsuch that ab = e. Prove that Ris 

a division ring. 


In Problems 16-23, by a right (left) guasi-regular element in a ring R, 
we mean an element a € R such that the equation 


a+x-—ax=0 (a+x-—xa=0) 


has a solution x= DE R. 
An element a € R is called quasi-regular if it is both right and left 
quasi-regular in R. 


16. 


17. 


18. 


20. 


21. 


Show that ifa € Ris nilpotent, then a is quasi-regular. Further, if 

R has a unity, then | — a 1s invertible. 

Let a,b € R such that ab is nilpotent. Show that ba is nilpotent. 

Show also that if ab is nght quasi-regular, then ba is also night 

quasi-regular. 

Let R be a ring. Define a circle composition ° in R by ae b= 

a+b—ab,a,bE R. 

(a) Show that (R.-) is a semigroup with identity. 

(b) If Sis the set ofall quasi-regular elements in R, then (S,°) isa 
group. 

(c) Suppose R has unity. Let (U(R),-) denote the group of units 
(i.e., invertible elements) of R. Show that (U(R),:) = (S,°). 

Let a,b € R such that ba = b and a is nght quasi-regular. Show 

that b = 0. 

Let a€ R and let a” be nght quasi-regular, where n is some 

positive integer. Show that a is night quasi-regular. 

Let e be an idempotent in aring R. If eis right quasi-regular, show 

that e = 0. 
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22. 


23. 


24. 


25. 
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Call an element a in a ring R (von Neumann) regular if there 

exists an element 5 € R such that aba = a (for example, all in- 

vertible elements and idempotent elements are von Neumann 

regular). Let R be a ring with | such that each regular element 

different from | is right quasi-regular. Show the following: 

(a) 0 #€a€ Ris invertible if and only if a is regular. 

(b) 0,1 #aER is invertible if and only if a is nght quasi- 
regular. 

(c) Rhas no nonzero nilpotent elements. 

(d) Rhas no idempotents different from 0 and |. 

(e) Either 2 = 0 or 2 is invertible in R. 

(f) The set of invertible elements in R along with 0 forms a 
division ring. 

Let R be a ring with more than one element. Show that R is a 

division ring if and only if all but one element are nght quasi- 

regular. 

Let R be the direct sum of a family of rings R,, i = 1,2,...,2. Show 

that the invertible elements of R are the elements (1, ,u>,...,2,), U; 

an invertible element of R,. In other words, show that 


U(R) = U(R,) x U(R,) x --- x U(R,), 

where U(R), U(R;) denote, as usual, the group of units of the rings 
R and R,, respectively. 

Let R be the direct sum of a family of rings R, ,R3,....R,, n> I. 
Show that R cannot be an integral domain. Also show that a = 
(a, ,@>,....€,) © Ris nilpotent ifand only if each a,, i = 1,2,...,m, 1s 
nilpotent. 


Additional examples of rings 


(a) Ring of formal power series 
Let R be a ring, and let 


R{[x]] = {(@,) = (49,4, ,@2,...)]a, € R). 


Define addition and multiplication in R[[x]] as in R[x]; that is, 


(a,) + (b,) = (a; + 5,), 
(a,b,) = (c,), 


where c, = 2)4,—; 2,5,. Then (R[[x]], + ,:) becomes a ring, called the ring 
of formal power series over R. As in a polynomial ring, the elements of R 
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themselves may be considered to be elements of R[[.x]]. It is an interesting 
exercise to show that if R is an integral domain, then R[[x)]} is also an 
integral domain. Further, the elements of R[[x]] are generally wntten as 
>a a,x", 


(b) Ring of formal Laurent series 
Let R be a ring, and let 


R(X) = ((...50,0, a5 Ag+ 4 -++sp sy 52,...)|@; & R). 


In other words, the elements of R(x) are infinite sequences, with the 
understanding that at most a finite number of the components a,, with ia 
negative integer, are different from zero. If addition and multiplication of 
elements of R(x) are defined precisely as in R[x) or in R{[x)], then R¢x) 
becomes a ring containing R[[x)] as a subring. R(x) 1s called the ring of 
formal Laurent series over R. Straightforward computations yield that 
Rx) is a field if R is a field. 


(c) Group algebra 


Let G be a group, R a ring with unity, and R[G] the set of mappings 
at f(a) of G into R such that f(a) =0 for all but a finite number of 
elements of G. Define addition, multiplication, 0, and | in R[G) by 


(f+ g(a) = f(a) + g(a), 
(/gXa) = = S(b)g(c), 
O(a) = 0, 
l(fe)=1, Ma)=0 ifaxe. 


Show that R[G] is a ring called the group algebra of G over R. Show that if 
R is commutative, then R is contained in the center of R[G]. As an 
exercise, write the multiplication table and list some properties for the 
group algebra F[G], where F = Z/(2) and G is a group of order 3. 


(d) Quaternions over any field 


Readers not familiar with the notion of a basis of a vector space may skip 
this. Let Q be a vector space of dimension 4 over a field K. Let (1,i, j,k} bea 
basis of Q over K. We shall make Q into an algebra over K by suitably 
defining multiplication of these basis elements in Q as follows: 


(Recall that the product of any element in the left column by any element 
in the top row (in this order) is to be found at the intersection of the 
respective row and column.) We extend multiplication to the whole set Q 
in the obvious manner. This yields that Q is an algebra over K, called the 
algebra of quaternions over K. 

If K = R, then by taking 


(5 thin(OT -r}in(S, ote (er %') 


we see that Qis the same as the ring of real quaternions H, discussed earlier 
in this chapter. 
As an exercise show that if K = C, then Q is not a division ring. 


CHAPTER 10 


Ideals and homomorphisms 


The concept of an “ideal” in a ring is analogous to the concept of a 
‘normal subgroup” in a group. Rings modulo an ideal are constructed in 
the same canonical fashion as groups modulo a normal subgroup. The 
role of an ideal in a ‘“Shomomorphism between rings”’ is similar to the role 
of a normal subgroup in a “homomorphism between groups.” Theorems 
proved in this chapter on the direct sum of ideals in a ring and on homo- 
morphisms between rings are parallel to the corresponding theorems for 
groups proved in Chapters 5 and 8. 


1 Ideals 


Definition. A nonempty subset S of a ring R is called an ideal of R if 


(i) abe Simpliesa—beES. 
(ii) a€ Sandre Rimply are Sand rae S. 


Definition. A nonempty subset S of a ring R is called a right (left) ideal if 


(i) abe Simpliesa—beES. 
(ii) are S(ra€ES) forallac Sandre R. 


Clearly, a right ideal or a left ideal is a subring of R, and every ideal is 
both nght and left, so an ideal is sometimes called a two-sided ideal. 
Trivially, in a commutative ring every right ideal or left ideal is two-sided. 

In every ring R, (0) and R are ideals, called trivial ideals. 
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1.1 Examples of ideals 


(a) In the ring of integers Z every subring is an ideal. To see this, let /be a 
subring of Z and a € I, re Z. Then 


r times 
TIN, 
ar=atat-:::+a, ifr > 0, 
=0 ifr=0, 
—r times 
re nn 
=—Q-— Qi @: ifr<0. 


So in every case ar € I. Hence, J is an ideal of Z. 

(b) The right as well as left ideals ofa division ring are trivial ideals only. 

Indeed, ifa nght or a left ideal J ofa ring R containsa unit of R, then /is 
the whole ring, because a unit vu € /implies uu~! € J; that is, 1 € J, so for 
all rE R, we have r = rl € J. Hence, / = R. 

(c) Let R be a ring and a € R. Then 


aR = ({ax|x € R) 
is a right ideal of R, and 
Ra = {xa\|x € R) 


is a left ideal of R. If R is commutative, then aR is an ideal of R. 

We note that a need not belong to aR. A sufficient condition for a to be 
in aR is that | € R;in this case aR is the smallest right ideal containing a. 

(d) Let R be then X n matrix ring over a field F. Forany 1 si < let A, 
(or B,) be the set of matrices in R having all rows (columns), except 
possibly the ith, zero. Then 4, is a right ideal, and B, isa left ideal in R. We 
later show that R has no nontnivial ideals. 

(e) Let R be the ring of 2 X 2 upper triangular matrices over a field F. 
Then the subset 


={[0 3] 


is an ideal in R. 

(f) Let R be the ring of all functions from the closed interval [0,1] to the 
field of real numbers. Let c € [0,1] and J = (fE R|f(c) = 0). Then /is an 
ideal of R. 

(g) Let R = F,, the 2 X 2 matrix ring over a field F. Let S = [5 £] be the 
set of upper triangular matrices over F. Then S is a subring of R. If 
I = [8 ], then /is an ideal of S, but / is neither a right ideal nor a left ideal 


of R. 
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Next we prove an important result about ideals in the matrix ring R,,. If 
A isan ideal in the ring R, then it can be easily shown that the ning A, of all 
nn matrices with entries from A is an ideal in R,. The following 
theorem states that the converse is true for rings with unity. 


1.2 Theorem. [fa ring R has unity, then every ideal I in the matrix 
ring R,, is of the form A,,, where A is some ideal in R. 


Proof. Let (e,)), i,j = 1,2,....2, denote the matrix units in R,. Set A = 
{a,, © R\|Za_e,;€ 1). Then we claim A is an ideal in R. For let a,,,5,, € 
A, Then there exist matrices a = 2a,,e,,and 8 = 2b,,e,;,1n I. Because J is 
an ideal, a— BE I; so a,, — 6,,€ A. Next, let re R and a,, € A with 
2a,,e, € I. Then the matnx (2a;,e;,)(re,,) = Za,re, € 1, and, hence, 
a,,r € A. Similarly, ra,, € A. So A 1s an ideal in R. We now proceed to 
show that J = A,. Let x = 2a,,e,, € J and let rand s be some fixed integers 
between | and n. Our purpose is to show that we can multiply x suitably 
on the right and left to produce an element in / whose entry at the (1,1) 
position is a,,. This will mean that all a,; © A and, hence, x € A,. Clearly, 


a(S ayes)e = ArsO ys 


which shows a,, € A, as desired. Thus, / C A,. On the other hand, let 
x = Za,,e,,€ A,. Here we will show that each term a,,e,; of x is in J. 
Because a,,€ A, there exists a matrix 25,,e,,€ 7 such that 5,, = a,;. 
Then e,(28,,e,,)€,; = 5,,e;; © I; that is, a,,e,,€ I for each t, j = 1,2,...,7. 
Hence, x € J. This completes the proof. O 


As an immediate consequence, we get an important result contained in 


1.3 Corollary. If D is a division ring, then R = D,, has no nontrivial 
ideals. 


Proof. Let I be a nonzero ideal in D,. Then | = A,, where A is some 
nonzero ideal in D. But the only ideals in any division ring D are (0) and D. 
Thus, A = D; so / = D,,, proving that (0) and D, are the only ideals in the 
ring D,. O 


Remark. Ifn > 1, D,, has nontrivial right, as well as left, ideals [see Exam- 
ple 1.1(d)]. Because (0) and D are also the only right (or left) ideals of D, it 
follows that Theorem 1.2 does not hold, in general, if we replace the word 
“ideal” by “‘right ideal” (or “‘left ideal’) in the theorem. 
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Further, the following example shows that Theorem 1.2 is not neces- 
sarily true if R is a ring without unity. 


Example. Let (R,+) be an additive group of order p, where p is prime. 
Define multiplication by ab = 0 for all a,b € R. Then R is a ring without 
unity. If X isan additive subgroup of (R,+), then_X is also an ideal, because 
for all x EE X and rE R, we have xr=O€E X and rx=O€E YX. It then 
follows that a nonempty subset X of R is an ideal of R iff (X,+) is a 
subgroup of (R,+). But since (R,+) has no proper subgroups, (0) and R are 
the only ideals of R. Now, it is straightforward to verify that 


={(5 8) 


is an ideal in R,. Suppose / = A, for some ideal A in R. This implies 
I = (0) or R2, since (0) or R are the only ideals in R. But clearly J # 0, and 
Ig R,. Hence, Theorem 1.2 does not hold, in general, for matrix rings 
over rings without unity. 


a,be r} 


1.4 Theorem. Let (A,)je, be a family of right (left) ideals in a ring R. 
Then 1) <4 A; is also a right (left) ideal. 


Proof. Let a,b EN ie,4;, rE R. Then for all i€ A, a— bE A,, and ar 
(ra) € A,, because A, are right (left) ideals. Thus a — 6 and ar (ra) € 
,<,A4;, proving that ,_,A, is a right (left) ideal. O 


Next, let S be a subset of a ring R. Let of = (AA 1s a right ideal of R 
containing S}. Then. # © because R € of. Let! =1,-,A. Then /is the 
smallest right ideal of R containing S and is denoted by (S),. The smallest 
right ideal of R containing a subset S is called a right ideal generated by S. 
If S = (a,,...,@,,) 1S a finite set, then (S), is also written (a,,...,a,,),. Simi- 
larly, we define the /eft ideal and the ideal generated by a subset S, 
denoted, respectively, by (S), and (S). 


Definition. A right ideal I of a ring R is called finitely generated if J = 
(a,,....4,,),f/or some a,E R, | sism. 


Definition. A right ideal I of a ring R is called principal if I = (a), for 
some ae R. 


In a similar manner we define a finitely generated left ideal, a finitely 
generated ideal, a principal left ideal, and a principal ideal. 
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We leave it as an exercise to verify that 


(a) = py ras, + ra + as + nalr,s,r,,5,E R,nEZ >, 
finite sum 


(a), = (ar+ nalrE Rin € Z}, 
(a) = {ra + najrE R, n€& Z)}. 


If 1 € R, these can be simplified to 


(a) = >> ras,\r;,5; © rt, 


finite sum 
(a), = {ar|r€ R), 
(a) = (ra|r © R). 


In this case the symbols RaR, aR, and Ra are also used for (a), (a),, and 
(a), respectively. 


Definition. A ring in which each ideal is principal is called a principal ideal 
ring (PIR). 


Similarly, we define principal right (left) ideal rings. A commutative 
integral domain with unity that is a principal ideal ring is called a principal 
ideal domain (PID). 

We note that all ideals in the ring of integers Z and the polynomial ring 
F [x] over a field F are principal. Let J be a nonzero ideal in Z, and let 1 be 
the smallest positive integer in / (note that / does contain positive in- 
tegers). Then for any element m € J, we wnite m = qn +r, whereO Sr< 
n. Then r= m— qn GI. Because n is the smallest positive integer in /, r 
must be zero. Hence, »m = gn, so / = (n). Similarly, by using the division 
algorithm for F[x] (Theorem 4.1, Chapter 11), we can show that every 
ideal in F [x] is principal. Thus, Z and F[x] are principal ideal rings. 

We now proceed to define the quotient ring ofa ring R. Let / be an ideal 
in aring R. For a,b € R define a = b (mod /) ifa— 6 € IJ. Then = is an 
equivalence relation in R. Let R// denote the set of equivalence classes, 
and let a € R/I be the equivalence class containing a. It can be easily 
checked that the class a consists of elements of the form a + x, x € J, and 
thus, it is also denoted by a + /. 

Wecan make R//intoaring by defining addition and multiplication as 
follows: 


atb=a+b, ab=ab. 
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We can easily show that these binary operations are well defined. For 
example, we proceed to show that multiplication is well defined. There- 
fore, let a= Cand b=d. Thena-—c€/and b—dé I. Write 

ab — cd=a(b— d) + (a— od. 


Because a — c, b — d€ I, which is an ideal, we have ab — cd € I. Hence, 
ab = cd. This proves that multiplication i is well defined. 
We show (R/I/,+, -) is a ring. First, (R//,+) is an additive abelian group: 


at+(b+c)=at+(b+o=(at(b+o)=(atbt+c 


mat+b+c=(at+b)+. (1) 

a+0=at+0=a. _ (2) 

a+(-a a) =a+(—a) a)=0. (3) 

a+b=at+b=b+a=b+4. (4) 
Second, (R// :) is a semigroup: 

a(b Cc) = a(bc) = a(bc) = (ab)c = (ab) ¢ = (ab)é. (5) 


Third, the distributive laws hold: 
a(b + ¢) = a(b + c) = a(b + c) = ab + ac 
_ =ab+ac=ab+ac. (6) 
(6+c)a=ba+ca, similarly. (7) 
Further, if 1 € R, then | is the unity of R/J; and if R is commutative, 
then R// is also commutative. 


Definition. Let I be an ideal ina ring R. Then the ring (R/I,+,°) is called 
the quotient ring of R modulo I. 


The ring R/I/ is also sometimes denoted by R if there is no ambiguity 


about /. 
If J = R, then R// is the zero ring. If J = (0), then R/J is, abstractly 
speaking, the same as the ring R by identifying a + (0), aE R, with a. 


1.5 Examples 
(a) Let (n) = (naja € Z) be an ideal in Z. If n * 0, then the quotient ring 
Z/(n) is Z,, the usual ring of integers modulo x. If n = 0, then Z/() is 
the same as Z. 

(b) Let R be a ring with unity, and let R[x] be the polynomial ring over 
R. Let I = (x) be the ideal in R[x] consisting of the multiples of x. Then 
the quotient ring 


R{x]/I = (ala © R). 
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Solution. Let x € I. Then x = 0. Therefore, if 

a+ bx +cx?--> © Rix}/I, 
then 

at bx+cx?+- =at+bxt+cx?+--=4, 


showing that R[x]// = (aja € R). 
(c) Consider the quotient ring R[x]/(x? + 1). Then 


This yields x3 = — x, x* = 1, and so on; so in general, x" = + 1 ifniseven 
and x” = +x ifn is odd. Let 
at bx + cx? +-** GE R[x]/(x? + 1). 
Then 
+ cx? +... 


atbx+cx?te. = 
= for some a, BE R. 


a+ bx 
a+ Bx 
Thus, 

R[x}/(x? + 1) = (a+ Bx|labBER) where x?=—-1. 


Indeed, R[x]/(x? + 1) is the field of complex numbers, where a, xR, is 
identified with a, and x is identified with /—1. 
(d) Let R =(% 9), and let A =(° 2) be an ideal in R. Then 
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R/A is, indeed, the ring of integers, where (2 9) is identified with n. 

(e) Find the nontrivial (i) right ideals and (ii) ideals of the ring R = (7 9). 
Solution. (i) Let A be a nonzero right ideal of R. Let 


(” 2) E A for some a & a} 


Clearly, X is an additive subgroup of Z; hence, X = n)Z for some no € Z. 
Case 1. X # (0); that is, m) # 0. 
We show 


= NL Q0 
a=(" 2), 
Let a € Q be such that 
Ny a 
(‘e Gea. 
Let z€ Zand gE Q. Then 
NoZ G@\_[% a z q/n 
& ‘) (‘ Na nea. 
Hence, 
NL Q 
("zou 
Trivially, 
NL Q 
1c("2 9) 
Therefore, 


(2 9) 


as claimed. The proof also shows that A is a principal right ideal of R. 


Case 2. X = (0). 
Let 
= 0 q 
K {aca ¢ 2) <a}. 
Clearly, K is an additive subgroup of Q, and A = (8 *). Conversely, if K is 
an additive subgroup of Q, it is easy to verify that (§ *) is indeed a right 


ideal of R. Therefore, in this case any right ideal of R is of the form (8 *), 
where K is an additive subgroup of Q. 


x={nez 
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(11) It is straightforward to verify that the nontrivial right ideals 


-{%Z Q 
A ( 0 a), 0# nm € Z, (1) 
and 
0 K = 
A= t 4 ; K an additive subgroup of Q, (2) 


of Rare also left ideals. Therefore, we have an example of a ring that is not 
commutative in which each right ideal is an ideal. It is interesting to note 
that each left ideal of R is not an ideal. (Consider 


("3") 


where ny and a are fixed elements in Z and Q, respectively.) 


mezh, 


Problems 


1. Let R be a commutative ring with unity. Suppose R has no 
nontrivial ideals. Prove that R is a field. 

2. Prove the converse of Problem 1. 

3. Generalize Problem | to noncommutative rings with unity hav- 
ing no nontrivial right ideals. 

4. Find all ideals in Z and also in Z/(10). 

5. Find all ideals in a polynomial ring F[x] over a field F. 

6. Find right ideals, left ideals, and ideals of a ring R = [9 9). 

7. Show that every nonzero ideal in the ring of formal power series 
F[[x]] in an indeterminate x over a field F is of the form (x”) for 
some nonnegative integer m. 

8. Let L(R) be the set of all right (left) ideals in a ring R. Show that 
L(R) is a modular lattice (see Problem 17, Section 1, Chapter 5). 
Give an example to show that the lattice L(R) need not be dis- 
tributive. 


[A lattice is called distributive if a v (b A c)=(av b) A (a v c) for all 
a,b,ceL.] 


2 Homomorphisms 


Let R be a ring, and let / be an ideal in R. Let R// be the quotient ring of R 
modulo /. Then there exists a natural mapping 


yn: R—-R/I 
that sends a € Rintoa € R/I. This mapping preserves binary operations 
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in the sense that - 
n(a + b) = n(a) + n(b), 

n(ab) = n(a)n(d), 


A mapping with these two properties is called a homomorphism of R 
into R/J. The mapping n is called a natural or canonical homomorphism. 


abe R. 


Definition. Let f be a mapping from a ring R into a ring S such that 


(i) flat b)=f(a)+ f(b), abeER, 
(iit) f(ab)=f(a)f(b), =a, DER. 


Then f is called ahomomorphism of R into S. 


If f is 1-1, then fis called an isomorphism (or a monomorphism) of R 
into S. In this case fis also called an embedding of the ring R into the ring 
S(or R is embeddable in S); we also say that Scontains a copy of R, and R 
may be identified with a subring of S. The symbol R G, S means that R is 
embeddable in S. 

If ahomomorphism /from a ring R into a ring S is both 1-1 and onto, 
then there exists a homomorphism g from S into R that is also 1-1 and 
onto. In this case we say that the two rings R and S are isomorphic, and, 
abstractly speaking, these rings can be regarded as the same. We write 
R = S whenever there is a 1-1 homomorphism of R onto S. As stated 
above R = S implies S = R. Also, the identity mapping gives R = R for 
any ring R. It is easy to verify that iff; R — S and g: S — T are isomor- 
phisms of R onto Sand S onto 7, respectively, then gfis also an isomor- 
phism of R onto 7. Hence, R = Sand S = Timply R = T. Thus, we have 
shown that 


Isomorphism is an equivalence relation in the class of rings. 


Let us now list a few elementary but fundamental properties of homo- 
morphisms. 


2.1 Theorem. Let f: R — S bea homomorphism ofa ring R intoa ring 
S. Then we have the following: 


(i) If0 is the zero of R, then f(0) is the zero of S. 
(ii) Ifac R, then f(—a) = — f(a). 
(iti) The set {f(a)\a € R) is a subring of S called the homomorphic 
image of R by the mapping f and is denoted by 1m for f(R). 
(iv) The set{a € R\f(a) = 0) is an ideal in R called the kernel of fand 
is denoted by Ker for f~- (0). 
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(v) If1 ER, then f(1) is the unity of the subring f(R). 
(vi) IfR is commutative, then f(R) is commutative. 


Proof. (i) Let a € R. Then f(a) = f(a + 0) = f(a) + f(0), and thus /(0) is 
the zero of S. For convenience, {(0) will also be written as 0. 
(ii) f(0) = f(a + (—a)) = f(a) + f(—a); therefore, f(— a) = —f(a). 
(iii) Let f(a), f(b) € f(R). Then 
f(a) — f(b) = f(a) + f(— b) = fla — b) € f(R). 
Also, 


f(a) f(b) = f(ab) € f(R). 
Hence, f(R) is a subring of S. 
(iv) Let a,b € f- (0) = Kerf. re R. Then 
f(a — b) = f(a) — f(b) =0 -0=0. 
Therefore, a — b € f— (0). Also 
Sar) = f(a) f(r) = Of(r) = 0. 


Thus, ar € f—'(0). Similarly, ra € f—'(0). Hence, f~'(0) is an ideal in R. 
(v) If aE R, then f(a) f(1) = f(al) = f(a). Similarly, f(1) f(a) = f(a). 
Hence, f(1) is the unity of /(R). 
(vi) If a,b € R, then 


f(a) f(b) = f(ab) = f(ba) = f(b) f(a). 
Thus, f(R) is commutative. O 


It is clear that if fis 1-1, then Ker f= (0). 

Conversely, if the kernel of ahomomorphism /: R — Sis (0), then / is 
necessarily 1-1. For if f(a) = f(b), it follows that f(a — b) = 0; thus, 
a — bE Ker f= (0). This shows a = 5, and therefore fis 1-1. We record 
this fact separately in 


2.2 Theorem. Let fR — S bea homomorphism ofaring R into aring 
S. Then Ker f= (0) ifand only if fis 1-1. 


Let N be an ideal in a ring R. Form the quotient ring R/N of R modulo 
N. We can define a mapping 
yn. R— R/N 


in an obvious way (called the canonical or natural homomorphism) by the 
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rule 
n(a)=a+N=a. 
Then 7 is a homomorphism of R onto R/N because 


n(a+ b)=a+b=a+ b=n(a) + (bd), 
n(ab) = ab = ab = n(a)n(6). 


Also, if x € R/N, there exists x € R such that n(x) = x. Thus, 7 is a 
homomorphism of R onto R/N; therefore R/N is a homomorphic image 
of R. It is an important fact that, in a certain sense, every homomorphic 
image of a ring R is of this type — a quotient ring of R modulo some ideal 
of R. 

This is stated more precisely in the following 


2.3 Theorem (fundamental theorem of homomorphisms). Let f be a 
homomorphism of a ring R into a ring § with kernel N. Then 


R/N=Imf. 


Proof. Let us define g(a + N) = f(a). Then g is a mapping of R/N into 
Im f, fora+ N=65+ N implies a— FEN, or f(a — 6) = O. This gives 
(a) — f(b) = 0, so gis well defined. Next, gis ahomomorphism. Writing 
a for a + N, we have 


g(a + b) = g(a + b) = f(a + b) = f(a) + f(b) = g(a) + 2(b). 


Similarly, g(a) = g(@)g(b). Clearly, gis an onto mapping. We show g Is 
l-1. Let f(a) = f(b). Then f(a — 6) = 0, soa— DEN. But then a= b. 
This shows g is 1-1. Hence, R/N=Imf. O 


As remarked earlier, we regard isomorphic rings (abstractly) as the 
same algebraic systems. In this sense this theorem states that the only 
homomorphic images of a ring R are the quotient rings of R. Accordingly, 
if we know all the ideals NV in R, we know all the homomorphic images 
of R. 

The fundamental theorem of homomorphisms between rings is also 
stated as follows. 


2.4 Theorem (fundamental theorem of homomorphisms). Given a 
homomorphism of rings f: R — S, there exists a unique injective homo- 
morphism g: R/Ker f— S such that the diagram 
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R/Ker f 


commutes; that is, f= gn, where n is the canonical homomorphism. 


Clearly, g defined as g(a + N) = f(a) is injective. Also, f= gn, as 
proved in Theorem 2.3. To observe that g is unique, let f= hn, where 
h:R/Ker f— Sis a homomorphism. Then gn(a) = hn(a) for all a € R; so 
g(at+ N)=h(a+N). This provesg=h. O 


Next, let f be a mapping ofa ring R into a ring S, and let A be a subset of 
S. We denote by f~ '(A) the set ofall elements of R whose images under the 
mapping fare in A; that is, 


J~MA) = (r& Rif(r) € A}. 


We remark that f—! is a mapping of subsets of S into subsets of R. 

If, however, fis a 1-1 mapping of R onto S, then the symbol /—' is used 
to denote the mapping of S onto R defined by /~'(s) = r, where r is the 
unique element of R such that f(r) = s. The context will make it clear in 
which sense the symbol f—! is used. 


2.5 Theorem (correspondence theorem). Let f; R— S be a homo- 
morphism ofa ring R ontoaring S, and let N = Kerf. Then the mapping F: 
A+» {(A) defines a 1-1 correspondence from the set of all ideals (right 
ideals, left ideals) in R that contain N onto the set of all ideals (right ideals, 
left ideals) in S. It preserves ordering in the sense that A & Biff f(A) ¢ f(B). 


Proof. Let X be an arbitrary ideal in S, and set A = f—'(X). Then A is an 
ideal in R. For if a,b € A andr € R, then f(a — b) = f(a) — f(b) € X, and 
S(ar) = f(a) f(r) € X, since f(a) €& X and X is an ideal. Thus, a— bE 
S-\(X) and ar € f—'(X). Similarly, ra € f-'(X). Hence, A = f—'(X) is an 
ideal in R. For any ideal A in R it is easy to check that f(A) is an ideal in S. 

To show that the mapping F 1s onto, we show that any ideal X 1n Sis of 
the form f(A) for some ideal A in R. In fact, we show X = f(f-'(X)). 
Trivially, X D> f(f—'(X)). So let x € X. Because fis onto, there exists an 
a€&R such that f(a) = x. Then a € f—'(X); so x = f(a) E f(f— (X)) as 
desired. 
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We now show that the mapping F is 1-1. Suppose that A and B are 
ideals in R containing N such that {(A)=/(B). We claim that 
f—*(f(A)) = A. Here it is trivial that A C f-—'(f(A)). So let x € f-'(f(A)). 
Then f(x) € f(A). This implies f(x) = f(a) for some a€ 4A; that is, 
x —a€KerfCA. Hence, x € A. Similarly, f—'(/(B)) = B. Therefore, 
f(A) = f(B) implies A = B. 

Finally, let .A and B be ideals in R such that A ¢ B. Then f(A) C f(B). If 
S(A) = f(B), we get, as before, A = B. Thus, f(A) & f(B). Conversely, let 
f(A) G f(B). Then A = f~'( f(A) C f~'(/(B)) = B. But since f(A) ¢ f(B), 
we cannot have A = B. Hence, A ¢ B. The 1-1 correspondence between 
the set of right or left ideals can be established exactly in thesame way. © 


An important consequence of Theorem 2.5 is the following immediate 
result, which we state as a theorem in view of its importance throughout 
our study. 


2.6 Theorem. // K is an ideal in a ring R, then each ideal (right or left 
ideal) in R/K is of the form A/K, where A is an ideal (right or left ideal) in R 
containing K. 


Proof. Let 7: R— R/K be the natural homomorphism of R onto R/K. 
Then any ideal (right or left ideal) in R/K is of the form 7(A), where A is an 
ideal (right or left ideal) in R containing Ker 7 = K. Then Kis also an ideal 
in A (regarded as a ring by itself), and (A) = {n(a) = a+ Kja € A} is 
precisely A/K. This proves the theorem. O 


Theorem 2.6, in particular, shows that if an ideal M7 in a ring R is such 
that M/ # R, and no ideal of R (other than R itself) contains M properly, 
then R/M has no nontrivial ideals. Such ideals (called “‘maximal’’) exist, 
in general, in abundance and play a very important role in the theory of 
rings and fields. The examples are ideals (p) in Z, where p is prime, and 
ideal (x? + 1) in R[x]. We return to this discussion in Section 4. 

A concept opposite to that of homomorphism is that of antthomomor- 
phism, defined next. 


Definition. Let R and S be rings. A mapping f: R — S is anantihomomor- 
phism if, for all x.y € R, 
S(xt+y=fxy+f(y) and f(xy) = f(y) > SQ). 
Thus, an antihomomorphism preserves addition but reverses multipli- 
cation. An antihomomorphism that is both 1-1 and onto is called an 
anti-isomorphism. 
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Let R = (R,+,:) be a ring. Define binary operation « in R as follows: 
Xoyp=y x for all x,yE R. 
It is easy to verify that the system (R,+,°) is a ring. 


Definition. Let (R,+,-) be a ring. Then the opposite ring of R, written R, 
is defined to be the ring (R,+,°), wherexe y= y: x, forall x,yE R. 


2.7 Examples 


(a) The only homomorphisms from the ring of integers Z to Z are the 
identity and zero mappings. 


Solution. Let f be a nonzero homomorphism. Then (/(1)? = 
SWIC) =f > 1) = fC) #0. So f(1) is a nonzero idempotent in 
S(Z) C Z. But the only nonzero idempotent in Z is |. Therefore f(1) = 1; 
hence, 


n times 
O—— 
f()=fQl+14+--4D=f(l) +--+ f(1) (if n > 0) 
—n times 
SS 
=f(-—1-1--:--N=f(-1)+--+f(-1) (ifn <0) 
= () (if m = 0). 


In each case f(n) = n, which completes the solution. 


(b) Let A and B be ideals in R such that B C A. Then 
R/A = (R/B)/(A/B). 
Solution. Define a mapping f: R/B — R/A by setting f(x + B) = x + A. 
f is well defined, for x, + B= x, +B implies x, —x,€ BCA. Thus, 
Xx, +A= X2 + A. ™ 
Now Ker f= (x + B|x + A = 0} = (x + Bix € A} = A/B. Hence, by 
the fundamental theorem of homomorphisms 


(R/B)/(A/B) = R/A. 
(c) Any ring R can be embedded in a ring S with unity. 


Solution. Let S'be the cartesian product of R and the set of integers Z; that 
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is S= RX Z. S is made into a ring by defining 


(a,m) + (b,n) = (a + bm + n), 
(a,m\(b,n) = (ab + na + mb,mn), 


where a,b € R and m,n € Z. It can be easily checked that S is a ring with 
binary operations defined above, and (0,1) is the unity of S. Also the 
mapping f: R— S defined by f(a) = (a,0) is a 1-1 homomorphism. 
Hence, fis an embedding of R into S. 


(d) Let R be a ring. Then (R,)° = (R™),. 


Solution. Define a mapping /: (R,,) — (R), by /(A) = "A, the transpose 
of A. Recall that, as sets (and indeed as additive groups), R = R® and 
R, =(R,)®. Then by the definition of the transpose of a matrix, 
{A + B) ='4 + 'B; so f(A + B) = f(A) + f(B). We now show f(A ¢ B) = 
S(A)f(B), where the multiplication of matrices f(A) and f(B) is in the ring 
(R™),. Assume A =(a,), B= (b,), f(A) ='4 = (ai), and f(B) ='B = 
(b;,). Then aj,= a, and bj,= 6, for all | si,jsn. Now f(A B)= 
(BA) = (BA). The (i, j) entry of {BA) is the (j,i) entry of BA, which is 
given by 


>> bi Aes ne 2 Qyi° bi, a >> Qin? by 
(i,j) entry of ‘A'B E (R®),. 


Hence, (BA) = ‘A'B. This proves that f(A ° B) = f(A) /(B). fis clearly 1-1 
and onto. Hence, (R,)°? = (R™),. 


Problems 


1. Find the ideals of the ring Z/(7). 

2. Prove that Z[x}]/(x?+1)=Z[i], where Z[i)= 
(a + bV—1|a,b € Z). 

3. Show that there exists a ring homomorphism f: Z/(m) — Z/(n) 
sending | to 1 if and only if njm. 

4. Show that the set N of all nilpotent elements in a commutative 
ring R forms an ideal. Also show that R/N has no nonzero nilpo- 
tent elements. Give an example to show that N need not be an 
ideal if R is not commutative. 

5. Let S be a nonempty subset of R. Let 


r(S) = (x € R|Sx = 0) and i(S) = (x € RixS = 0}. 


14. 


15. 


17. 


19. 
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Then show that r(S) and /(S) are right and left ideals, respectively 
(r(S) and /(S) are called right and left annihilators of S, respec- 
tively]. 

In Problem 5 show that r(S) and /(S) are ideals in R if Sis an ideal 
in R., 

Show that any nonzero homomorphism of a field F into a ring R 
is I-]. 

Let f: F — F bea nonzero homomorphism ofa field F into itself. 
Show that fneed not be onto. 

In any ring R show that the set A = (x € R|r(x) N B ¥ (0) for all 
nonzero right ideals B of R) is an ideal in R. 

Show that the intersection of two ideals is an ideal, but that the 
union need not be ideal. 

If A, B, and C are right (left) ideals in a ring R such that A ¢ BU 
C, show that A C BorACC. 

Show that a ring R with unity is a division ring if and only if R has 
no nontrivial right ideals. 

Let R, and R, beringsand R = R, X R,, thedirect product of R, 
and R,. Show that 


RT = ((x,0)|xER,), R= ((0,yy E Ro) 


are ideals in R. Also show that Rf = R,, R} = R, as rings. 
Let A be a right ideal of R. Define 


I(A) = {xeER|xaeA for all ae A}. 


Prove that /(A) is the largest subring of R in which A ts contained 
as an ideal. (/(A) is called the idealizer of A in R.) 

Show that (R/A), = R,/A,, where A is an ideal in a ring R. 

Let R be a ring. Show that R is anti-isomorphic to R°. 

Letf,g: R — Rbe antihomomorphisms (anti-isomorphisms) of a 
ring R into itself. Show that fg is a homomorphism (isomor- 
phism). 

Let R be a ring with unity. If the set of all noninvertible elements 
forms an ideal M, show that R/M is a division ring. Also show that 
this is so if, for each r € R, either r or | — r is invertible. 

Show that the following rings R satisfy the property of Problem 
18: that is, the set of all noninvertible elements forms a unique 
maximal ideal /. Find the ideal / in each case. 

(a) R=Z/(p’), p prime. 

(b) R= F{[x]]. 

(Such rings are called /ocal, as defined in Chapter 12.) 
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3 Sum and direct sum of ideals 


In this section we discuss the concept of the sum of ideals (right ideals, left 
ideals) in a ring R. 


Definition. Let A, ,A3,....A, bea family of right ideals in a ring R. Then the 
smallest right ideal of R containing each A,, 1 S isn (that is, the inter- 
section of all the right ideals in R containing each A,), is called the sum of 
A, Agy,Ap- 


3.1 Theorem. If A,,A3,....4, are right ideals in a ring R, then 
S = (a, + a, + +++ +a,|a,€ A,;, i= 1,2,...,n} is the sum of right ideals 
Ay ,Ag,..Ap. 


Proof. It is clear that 
S = (a, + a> + soe + a,|a, © A, i= 1,...,70) 


is a right ideal in R. Also, if a,eA,, then a, =a, +0+--- +0 is in S, and, 
hence, A, C S. Similarly, each A,, { = 2,...,n, is contained in S. Further, if 
T is any right ideal in R containing each 4,, then obviously7 > S. Thus, S 
is the intersection of all the right ideals in R containing each 4;. O 


Notation. The sum of right (or left) ideals A,,A,...,4, in a ring R is 
denoted by A, + A, +:--+ A,. From the definition of the sum it is clear 
that the order of A,’s in A, + A, +-:: +A, is immaterial. We write 
A, + A, + sas + A, as 2 fn Aj. 


Definition. A sum A = 27_,A, of right (or left) ideals ina ring R is calleda 
direct sum if each element aE A is uniquely expressible in the form 
Lr. 4,,4,E A, 1 SisnlfthesumA = 2A,isa direct sum, we write itas 


A=A,@A,@ °°: O4,=© A, 
=} 


Note. One can similarly define the sum and direct sum of an infinite 
family of right (left) ideals in a ring R. Although no extra effort is needed 
to talk about this, we prefer to postpone it to Chapter 14, where we 
discuss sum and direct sum of a more general family. 


3.2 Theorem. Let A, ,A3,...,A, be right (or left) ideals ina ring R. Then 
the following are equivalent: 
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(i) A= 2?,A, is a direct sum. 
(ii) If0=T%,4a,, a, €A,, then a,= 0, i= 1,2,...,n. 
(iit) Ay Zfuy, joi Ay = (0), f= 1,2,...,71. 
Proof. (i) => (ii) Follows from definition of direct sum. 
(ii) => (iii) Let x € A, N Thuy, jus Ay. Then 
X™Ma, teeta, taj, to°° +a, € Aj. 
Thus, 
0=a, +--+ 4,_,; + (—x) + aj, too +... 
Then by (ii) we get x = 0. 
(iii) => (i) Let a=a,+a,+°--+a, and a=b,+6,+--+6,, 
where a,,b,€ A,, i= 1,2,...2. Then 0 = (a, — b,) + (a, — 5) +°°° + 
(a, — b,). This gives 


a, — bE A, YA, = (0) 


Hence, a, = 5,. Similarly, a, = b,,...,.a, = b,. Hence, A = 27.,A, is a 
directsum. O 


3.3 Theorem. Let R,,R3,....R, be a family of rings, and let R= R, X 
R,X°::XR,, be their direct product. Let R? = {(0,...,0,a,,0,...,.0)|a, © 
R,). Then R= @"_,R* is a direct sum of ideals R?, and R? = R, as 
rings. On the other hand, ifR = © X17, A,, a direct sum of ideals of R, then 
R= A, X A, X°*:XA,, the direct product of the A,'s considered as rings 


on their own right. 


Proof. Clearly, R* are ideals in R, and R = Rf +-°- + Ry. We prove that 
R is a direct sum of ideals Rj. Let 


x€E RIN 5 R*. 
» j 
j@i 
Then 
x = (0,0,...,@,,0,...,0) = (Gy 25.00, Qj—4 0, p41 50009 Ap): 


This gives a, = 0 and, hence, x = 0. Therefore, 


R= OR 
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For the second part we note that if x © R, then x can be uniquely 
expressed as a, + a, +°**+a,,a,€ A,, 1 Sisn, Define a mapping 


f: OBA Ar X A, XX A, 


by 
f(a, + a2 Beet 2 a,) a (a, 825.045 An). 


Because 27, A, is direct, f is well defined. It is also clear that f is both 1-1 
and onto. Also, if x,y € OT7., A,, then f(x + y) = /(x) + f(y). To show 
S(xy) = f(x) f(y), we need to note that if x = a, + a, +°:: +a, and y= 
b, + b, +++: +5,, with a,,b,E A,, 1 Sisn, then, for i*j, a,b, = 0, 
since a,b,€ A, A,™ (0). This remark then immediately yields that 
S (xy) = f(x) f(y). Hence, fis an isomorphism. 0 


The direct sum R = © E*., A, is also called the (internal) direct sum of 
ideals A, in R, and the direct product A, X A, X-°-: X A, 1s called the 
(external) direct sum of the family of rings A,, i = 1,2,...,. In the latter 
case the notation A, © A, © -:: @ A, is also frequently used. The context 
will make clear the sense in which the term “direct sum” is used. 


Definition. A right (left) ideal 1 in a ring R is called minimal if (i) I * (0), 
and (ii) if J is a nonzero right (left) ideal of R contained in I, then J = I. 


It is clear that if J is a minimal right ideal, then J is generated by any 
nonzero element of /. Indeed, if Jis a nght ideal of R with the property that 
each nonzero element generates /, then / is minimal. To see this, let /be a 
right ideal of R such that 0 # JC J. Suppose 0 # a € J. By assumption, 
I[=(a),CJ,sol= J. 

If R is a division ring, then R itself is a minimal right ideal as well as a 
minimal left ideal. A nontrivial example of a minimal right ideal is 


Gy, °° Ay 
0 «+ QO 

e,R= {| - _ || ayED>, 
O «+ Q 


where R = D, is the n X nm matrix ring over a division ring D. Let J be a 
nonzero right ideal of R contained in e,,R. Let O # 27, a,,e,, © J. Then 
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for some I, 1 <! <n, a;,#0; so a;,' exists. This gives 


en ( aye, enhe J. 


Therefore, e,,R C JC e,,R. Hence, J = e,,R, showing that e,,R is min- 
imal. 


3.4 Examples 
(a) For any two ideals A and B in a ring R, 
A+B A 


OB *AnB 
y ae A+B A+B _ B Y A 
“) anB A B ANB ANB 
In particular, if R = A + B, then 
JR LR R 
ANB A 8B 


More generally, if A, ,A2,...,A, is a family of ideals in a ring R with unity 
such that A, + A, = forall 1 si,jsn,i#/J, then 


R R _R R 

ae KKK, 
be A 
(VA, A, A, R 


i=] 


Solution. (i) It is clear that Bis an ideal in.A + Band A /N Bisan ideal in A. 
Then the mapping f: A — (A + B)/B, given by f(a) = a + B, is clearly a 
homomorphism. Also, if x + B € (A + B)/B and x=a+b,aEA, bE 
B, then f(a) = a+ B= x + B. Thus, / is onto. It is easy to check that Ker 
JS=AQCB. Hence, by the fundamental theorem of homomorphisms, 
A/(A /N B) = (A + B)/B. 

(11) Note that A and B are also ideals in A + B considered as a ring by 
itself. The mapping 


g:A+B—(A+B)/A X(A+B)/B, 


given by g(x) = (x + A,x + B), x E A+B, is clearly a homomorphism 
whose kernel is A M B. We show that g is onto. 

Let (x + A,O + B)E (A + B)/A X (A + B)/B. Writex=a+b,aEA, 
bE B. Then g(b) = (6 + A,b + B) = (x + A,O + B). Thus, each element 
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of the type (x + A,0 + B) has a preimage in A + B by the mapping g. 
Similarly, elements of the type (0 + A,y + B) have preimages. Hence, g is 
an onto mapping. Then by the fundamental theorem of homomorphisms 
we get 


A+B A+B ATB 


ANB A B 
By (i) 
A+B B A+B A 
a “AnB "4 ~B- “ARAB 
proving (ii). 


To prove the last part we first prove by induction that 


R= A+ (4, lsisnn2=2. 
=} 

ji 
Assume that this is true for n — 1 ideals. Then 


a-l n 
R= R?=(A,+A,XA,+ ()A) CA, + 0)A,, 
(A, XA; p} A) d Q) Jj 


jh 


To complete the proof, we assume the result is true for 2 — 1 ideals. Then 
by (it) 


im| i=} 


(b) For any positive integer 7, 
Z/(n) = Z/(p}') X Z/(pZ) X-°> X Z/(pP), 
where n = p*' p?--> pf*, p, are distinct primes. 


Solution. Let A, = (pf), 1s isk. Weshow A, + A, = Z, i * j. Because p, 
and p, are relatively prime, we can find integers a and b such that 
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apf + bp? = 1. Then for all x € Z, 
x = xap;' + xbp‘/E(p;') + (pj’). 


Hence, A, + A, = Z, i ¥ j. 
We further show 


k k 
(n) = ()(27) ()A:. 


Recall that n = p¢' p$ --- pf. Let x € (n). Then x = Apt p? «+ pf, A € Z. 
This implies x € (p#) for all i = 1,2,...,k, and thus x € M4, A,. On the 
other hand, let x € M*,,A,;. Then 

X= pi'a, = pPa,="-=pha, a EZ. 


This shows pf‘|x, 1 s iss k. Because the p,’s are all distinct, we get 2 = 
p%'p? +++ pf* divides x. Hence, x € (n). This proves that (7) = MN, (pf). 
By example (a), the solution is then complete. 


(c) Let (e,), 1 s issn, be a family of idempotents in a ring R such that 
ee, = 0, i #). Then A = e,R@-::@e,R is a direct sum of right ideals 
eR, 1 sisn. (A family of idemopotents (e,), 1 s isn, is called an 
orthogonal family of idempotents if e,e, = 0, i # j.) 


Solution. Let 0 = e,a, ++: + €,a,, €,a, © e,R. Multiplying both sides on 
the left by e,, we get 0 = e,a,. Similarly, e,a,;= 0, 2s isn. Thus, by 
Theorem 3.2, A is a direct sum of nght ideals eR, | Sisn. 


(d) Let R bearing with unity, and let (A,), 1 s is 7, bea family of nght 
ideals of R. Suppose R = A, © -:-@ A,. Then there exists an orthogonal 
family of idempotents (e,), 1 s isn, such that A, = e,R. 

Solution. Because 1 € R, we can write 
l=e,+°:-+e,, e, € A;. 

Then 
e=ee te terzte.,e+°°° + ee. 

This yields 
0=e,e,+°°- + (e?—e) + e4,;e, +79 Fee. 


But then by Theorem 3.2, each term is zero; that is, e,e, = 0, j # i, e7 = e,. 
Further, e, € A,;implies e,R C A,. Next, let x € A;. Then 1 =e, +-:: +, 
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gives x = e,x + ... + e,.x, which gives, as before, x = e,x. Hence, x & e,R, 
proving that A, = e,R. 


(e) Let R be a ring with | such that 2 is a finite direct sum of minimal 
right (left) ideals. Then every right (left) ideal A in R is of the form eR (Re) 
where e is an idempotent. 


Solution. By hypothesis R = M, © ---@ M,, where M, are minimal right 
ideals in R. Assume that A # (0) and A # R. Because AN M,C M,and M, 
are minimal right ideals, either 4 M M, = (0) or AN M, = M,. Because 
A ¥* R, there exists, say, M, (after renumbering if necessary) such that 
M, ¢ A. But then M,N A = (0). So A, = A + M, is a direct sum. If A, * 
R, then there exists some M,, say M,, such that A, N M, = (0) and AC 
A, = A,®@ M,=A@M,©M,. By continuing this process we must 
come to a right ideal, say A,, that contains all 4, and therefore coincides 
with R. Thus, there exists a right ideal B such that R = A © B. By Exam- 
ple (d), A = eR. 


Problems 


1. Show that A + A = 4 for any ideal 4 in a ring R. 

2. Show directly that Z/(12) = Z/(3) X Z/(4). 

3. Let A,,A3,...,4, be ideals in a ring R with unity such that A, + 
A; = R forall i ¥ j. Show that if x, ,x2,...,x, © R, then there exists 
an element x € R such that x = x,(mod A,) forall i. (This is called 
the Chinese remainder theorem.) 

4. If eis any idempotent in R show that R= eR@(l—e)R isa 
direct sum of right ideals. (In case 1 ¢ R, then (1 — e)R denotes 
the set {a — eala € R).) 

5. Let R= R, © R, be the direct sum of the rings R, and R, each 
with unity. Denote by /, the projections of R onto R,, i = 1,2, 
given by /\(@, a2) = a, and f,(a,,a2) = a,. Prove the following: 
(a) Each fis an onto homomorphism. 

(b) If A is an ideal (right ideal) in R, then 


A = f(A) © f(A). 

(c) The ideals (right ideals) in R are precisely those ideals (right 
ideals) that are of the form A, © A,, where A, and A, are 
ideals (right ideals) in R, and R,, respectively. 

6. Let R = D, be the ring of n X n matrices over a division ring D. 

Let (e,,), 1 S i,j sn, be the set of matrix units. Show that 

(a) e,,R (Re,, is a right ideal (left ideal) in R, i = 1,2,...,n. 
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(b) Let A be a nonzero right ideal of R contained in e,,R. Then 
A = e,,R. Similarly, for left ideals, Re,,. 

(c) R= ®z2.¢,R is a direct sum of nght ideals e,,R. Also 
show this for left ideals. 

(d) If A is right (left) ideal of R, then A = eR (Re) for some 
idempotent e € R [see Example 3.4(e)]. 

7. Let Rand S be rings, and let A and B be ideals in R and S, 
respectively. Show that 


4 Maximal and prime ideals 

If A = (p¢') and B = (p) are ideals in Z generated by pf'and p?, respec- 
tively, where p, , p, are distinct primes and e, ,e2 are positive integers, then 
A+ B= Z. Such ideals are called comaximal ideals in Z. 


Definition. Two ideals A, B in any ring R are called comaximal if 
A+B=R. 


Definition. An ideal A in a ring R is called maximal if 


(i) A#*R; and 

(ii) for any ideal BDA, either B=AorB=R. 
Clearly, an ideal A is maximal if and only if the pair X, A, for all ideals 
X £ A, is comaximal. 


4.1 Theorem. For any ring R and any ideal A # R, the following are 
equivalent: 
(i) Ais maximal. 
(ii) The quotient ring R/A has no nontrivial ideals. 
(iii) Forany elementxE R,x€ A, A +(x) =R. 


Proof. (i) = (ii) By Theorem 2.6 the ideals of R/A are of the form B/A, 
where B is an ideal in R containing A. If B/A 1s not zero, then B # A and, 
therefore, B = R. Therefore, the only ideals of R/A are R/A and the zero 
ideal. 

(ii) => (iii) Let x € Rand x ¢ A. Then A + (x) isan ideal of R properly 
containing A. This implies that (A + (x))/A is a nonzero ideal in R/A and, 
hence, A + (x) = R. 
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(iii) => (i) Let B D A. If B * A, choose an element x € B, x ¢ A. Now 
by (111) A + (x) = R. But BD A +(x) =R. Hence, B=R. O 


Definition. A ring R is called a simple ring if the only ideals of R are the 
zero ideal and R itself. 


A field is clearly a simple ring. Indeed, a commutative simple ring with 
unity must be a field (Problem 1, Section 1). An example of a noncom- 
mutative simple ring is F,, the 7 X n matrix ring over a field F, n> 1. 

An immediate but important consequence of Theorem 4.1 is 


4.2 Theorem. /n a nonzero commutative ring with unity, anideal M is 
maximal if and only if R/M is a field. 


Proof. Suppose R is a nonzero commutative ring with unity. Then, for 
any ideal M, R/M is also acommutative ring with unity. If Misa maximal 
ideal, then, by Theorem 4.1, R/M is also a simple ring. LetO*aGER= 
R/M. ThenaRisa nonzero ideal in R. Because R is simple, aR= R.1 This 
implies that there exists be R such that ab = oF This shows that R is a 
field. Conversely, let R = R/Mbea field. Then R has no nontrivial ideals. 
So if K > Mis an ideal in R, the nonzero ideal K/M in R/M must be R/M. 
Hence, K = R, so M is a maximal ideal. O 


4.3 Examples of maximal ideals 


(a) An ideal M in the ring of integers Z is a maximal ideal if and only if 
M = (p), where p is some pnime. 

Recall that each ideal in Z is of the form (7), where n € Z. Now, clearly, 
(n) = (—7n). So we can always assume that any ideal in Z is of the form (7), 
where nis a nonnegative integer. 

Suppose (7) isa maximal ideal. Then Z/(”) isa field. But ifn is compos- 
ite, say n = 1,N2, wheren, > 1,n.> 1, thenn,n, = Oin Z/(n)and0 ¥ n,, 
0 ¥ n,, which is a contradiction because Z/(n) isa field. Hence, nis prime. 
To prove the converse, let a € Z such that p/a. Then there exist integers 
x,y, € Z such that ax + py = 1. This implies ax + py = 1, so ax = 1, 
which proves that Z/( p) is a field. 

(b) If R is the ring of 2 X 2 matrices over a field F of the form [¢ 8], 


a,b € F, then the set 


w= {[o eller] 
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is €@ maximal ideal in R. Clearly, AZ is an ideal in R. Let 


s-{{5 9] 


Then S is a subring of R that is isomorphic to F under the mapping 
[¢ 8] > a. Further, the mapping /: R — S, where 


a b a 0 
f 4 4 E 4 
is an onto homomorphism whose kernel is /. Thus, by the fundamental 


theorem of homomorphisms, R/M = S. Because S is a field, it follows 
that MM is a maximal ideal in R. 


ac Fh 


Definition. Let A and B be right (left) ideals in a ring R. Then the set 


y aybila,€ A,B, € Bt, 
finite sum 

which is easily seen to be a right (left) ideal in R, is called the product of A 
and B (in this order) and written AB. 


Note that if A and B are ideals in R, then their product AB Is an ideal 
in R. 


4.4 Theorem. Let A, B, and C be right (left) ideals in a ring R. Then 


(i) (AB)C = A(BC), 
(ii) A(B+C)=AB + AC and (B+ C)A = BA + CA. 


Proof. (i) Follows from the associativity of multiplication in R. 
(ii) Clearly AB, AC C A(B + C). Also, if ae A, bE B, cE C, then 
a(b + c) = ab + ac€ AB + AC. Hence, AB+ AC= A(B+C). O 


If A,,...,4, are right (left) ideals in a ring R, then their product is defined 
inductively. Because associativity [Theorem 4.4(i)] holds, the product of 
A,,...,4, may be written as A, A, °:: A, (without any parentheses). It may 
be verified that 


Ay Ay= | Y a alas As} 


finitesum 
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A prime integer p has the fundamental property that if pjab, where a 
and bare integers, then pia or p|b. In terms of the ideals in Z, this property 
may be restated as follows: If ab € (p), then a € (p) or b E (p). Equiva- 
lently, if (ab) C (p), then (a) C (p) or (5) C (p). This suggests the follow- 
ing 


Definition. An ideal P in a ring R is called a prime ideal if it has the 
following property: IfA and B are ideals in R such that AB ¢ P, thenA C P 
or BCP. 


Clearly, in any integral domain (0) is a prime ideal. In fact, acommu- 
tative ring R is an integral domain if and only if (0) is a prime ideal. For 
each prime integer p, the ideal (p) in Z is a prime ideal. 


4.5 Theorem. Jf R is a ring with unity, then each maximal ideal is 
prime. But the converse, in general, is not true. 


Proof. Let M be a maximal ideal, and let A and B be ideals in R such that 
ABC M. Suppose A ¢ M. Then A+M=R. Write | =a+m, aE A, 
meE M. Let b€ B. Then b= ab+ mbE€ AB+ MC M. Hence, BC M. 
This proves M is prime. 

The ideal (0) in the ring of integers is prime but not maximal. O 


For commutative rings we have 


4.6 Theorem. J{R is acommutative ring, then an ideal Pin R is prime 
ifand only ifabEe P,aE R, bE Rimpliesace PorbeE P. 


Proof. Let ab € Pand Pbea prime ideal. We know (a)(b) consists of finite 
sums of products of elements of the type na + ar and mb + bs, where 
nym € Zand r,s € R. Now 


(na + arX{mb + bs) = nmab + nabs + mabr + abrs. 


Because ab € P and Pisan ideal in R, we get (na + ar(mb + bs) or finite 
sums of such like products are in P. Hence, each element of (a)(b) is in P. 
Thus, (aX) C P. But since P is prime, (a) C P or (6) C P. This implies 
a€& P or bE P. Conversely, if abE P, aE R, bE R implies a€ P or 
b € P, then we show P is prime. Let A and B be ideals in R such that 
AB C P. Suppose A ¢ P. Choose an element a € A such that a ¢ P. Then 
AB C Pimplies aB C P, so ab € P for all b € B. But then by our hypoth- 
esisb€ P.Hence. BCP. O 
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4.7 Theorem. Let R be a commutative principal ideal domain with 
identity. Then any nonzero ideal P * R is prime if and only if it is max- 
imal. 


Proof. Let P = aR bea nonzero prime ideal in R. If Pis not maximal, then 
there exists an ideal M such that P¢ M and M # R. Now, M = bR for 
some b € R. Thus, aR ¢ bRimplies a = bx, x € R, and b ¢ aR. Because 
P= aR is a prime ideal and b ¢ aR, we have x € aR. Then x = ay for 
some y€ R. Then 


a= bx = bay 


implies | = by, because R is a domain. Hence, M = R, a contradiction. 
Therefore P is maximal. The converse follows from Theorem 4.5. © 


48 Examples 


(a) Let R be a commutative ring with unity in which each ideal is prime. 
Then R is a field. 


Solution. If ab = 0, then (a)(b) = (0). But since (0) is a prime ideal, (a) = 
(0) or (6) = 0. Hence, ab = 0 implies a=0 or b= 0; that is, R is an 
integral domain. Next, if a € R, then (a\a) = (a”) because R is commu- 
tative. But (a) is prime. Thus, (a) C (a?). Because (a?) is trivially con- 
tained in (a), we get (a) = (a?). This implies a € (a?), and, hence, there 
exists an element x € R such that a = a?x or a(1 — ax) = 0. Ifa # 0, we 
get 1 = ax, that is, a is invertible. Hence, R is a field. 


(b) Let R be a Boolean ring. Then each prime ideal P # R is maximal. 


Solution. Consider R/P. Because P is prime, ab € P (equivalently, ab = 
0) implies aE P or bE P (i.e.. a=0 or 6=0). So R/P is an integral 
domain. Now for all x € R, (x + PXx + P) = x?2+ P=x+ P.SoR/Pis 
also a Boolean ring in addition to being an integral domain. But we 
know that an integral domain has no idempotents except 0 and possibly 
unity. Thus, R/P =(0) or {0, T}. However, R/P =(0) implies R = P, 
which is not true. So R/P = {0, 1}, which obviously forms a field. Hence, 
P is a maximal ideal. 


(c) Let abea nonnilpotent element in a ring R, and let S = (a, a?, a?,...). 
Suppose P is maximal in the family F of all ideals in R that are disjoint 
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from S. Then Pisa prime ideal. (Note that the statement does not say that 
P is maximal in R. Precisely, it means that there does not exist any ideal 
X € F such that X > P.) 


Solution. Let AB C P, where A and B are ideals in R. If possible, let. 4 ¢ P 
and B¢ P. Then A+P3P and B+P2P. By maximality of P, 
(A + P)NS¥* Oand(B + P)N S # G. Thus, there exist positive integers 
i and j such that a! € A + Pand a/ & B+ P. Then 


alas E (A+ PKB + P) = AB + AP + PB + PPC P, 


because AB C P and because P is an ideal in R. Thus, PN S#0Q, a 
contradiction. Hence, AB GC P implies either A C P or BC P, which 
proves that P is a prime ideal. 


(d) Let R = C[0,1] be the ring of all real-valued continuous functions 
on the closed unit interval. If 14 is a maximal ideal of R, then there exists a 
real number 7,0 s rs 1, such that M = M, = (fE RI/(r) = 0), and con- 
versely. 


Solution. Let M be a maximal ideal of C[0,1]}. We claim that there exists 
r © [0,1} such that, for all fE M, f(r) = 0. Otherwise for each x € [0,1] 
there exists {, € M such that /,(x) * 0. Because /, is continuous, there 
exists an open interval, say /,, such that /,(y) * 0 for all y € /,. 

Clearly, [0,1] = Uyejo1)/,- By the Heine-Borel theorem in analysis 
there exists a finite subfamily, say, /,,,/,,,..../,,, of this family of open 
intervals J,, x € (0,1), such that (0,.1J=2,,UZ,U°:-UZ,,. 

Consider f= 27, f2,. Suppose that f(z) = 0 for some z in [0,1]. Now 
(0,1) = Uz, J,, implies that there exists /,, such that z € /,, (1 sk sn). 
Then /,,(z) * 0. But 


f(z) = 0 = D(L(2P = 0 = f(z) = 0, 


a contradiction. Thus, f(z) * 0 for any z € [0,1], which in turn yields that 
fis invertible and a fortiori M = C[0,1}, which is not true. 

Conversely, we show that M, is a maximal ideal of C[0,1]} for any 
r & [0,1]. Itis easy to check that M/, is an ideal. To see that it is maximal, we 
note that C[0,1]/M, is a field isomorphic to R. Alternatively, we may 
proceed as follows. 

Let J be an ideal of C[0,1] properly containing M,. Let g € J, 2g € M,. 
Then g(r) ¥ 0. Let g(r) = a. Then h = g — ais such that h(r) = 0; that is, 
hE M,,soa™=g—heEJ. But a # 0 implies that a is invertible. Conse- 
quently, J = R, which proves the converse. 
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Problems 


1. Let F bea field. Prove that (0) is a prime ideal in F,,. 

Prove that the ideal (x° + x + 1) in the polynomial ring Z/(2)[x] 
over Z/(2) is a prime ideal. 

3. Prove that the ideal (x* + 4) is not a prime ideal in the polyno- 
mial ring Q[x] over the field of rational numbers. 

4. Show that the ideal (x° — x — 1) in the polynomial ring Z/(3)[x] 
over the field Z/(3) is a maximal ideal and therefore, prime. 

5. Show that the following are equivalent for any ring R. 

(a) Pisa prime ideal in R. 

(b) For all a,b € R, aRbC Pimplies a € Por bE P. 

(c) Forallrightideals A, Bin R, AB C PimpliesA C PorBC P. 
(d) Forall left ideals A, Bin R, AB C Pimplies A C Por BC P. 
Let P be a prime ideal in R. Show that (0) is a prime ideal in R/P. 
(A ring S is called a prime ring if (0) is a prime ideal in S.) 

6. Let Pbea prime ideal in a ring R such that the quotient ring R/P 
has no nonzero nilpotent elements. Show that R/P is an integral 
domain. 

7. Let (P,)ie, be the family of all prime ideals in R and let A = 
Nie, P,. Show that A is a nil ideal. [Hint: Use Example (c) and 
assume that a maximal member in the family F exists. The exis- 
tence of a maximal member in F follows by Zorn’s lemma, given 


in Section 6.] 


§ Nilpotent and nil ideals 


Let A,,A3,...,A, be right (or left) ideals in a nng R. If A, = 
A, =:+::= A, ™ A (say), then their product is written as A”. It is possible 
that a right (or left) ideal A is not zero, but A” = (0) for some positive 
integer > 1. Forexample, in the ring R = Z/(4) the ideal A = (0,2) is not 
zero, but A? = (0). As another example, consider the ring R = (% 2) of 
2 X 2 (upper) triangular matrices. It has a nonzero ideal A = (8 2) such 
that A? = (0). Such right (or left) ideals are called nilpotent. 


Definition. A right (or left) ideal A in a ring R is called nilpotent if A” = (0) 
for some positive integer n. 


Clearly, every zero ideal is a nilpotent ideal, and every element in a 
nilpotent ideal is a nilpotent element. However, the set of nilpotent ele- 
ments in a ring R does not necessarily form a nilpotent ideal. (Indeed this 
set may not even be an ideal.) 

Aring R may have nonzero nilpotent elements, but it may not possess a 
nonzero nilpotent ideal, as the following example shows. 
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§.1 Example. Let R = F, be the ring of 2 X n matrices over a field F. 
Then R has nonzero nilpotent elements, such as e,,, i * j, 1 Ss i,j sn. Let 
I be a nilpotent right ideal in R with /* = 0, where k is some positive 
integer. Then consider the ideal 
k times k — 1 times k times 

— ————= Fann 

(RIX RI) -°: (RI) = RUR):** (IR)I G RIT ++: 1 = RIX = (0). 
Hence, R/ is a nilpotent ideal in R. But we know that the ring R = F, has 
no nontrivial ideals (Corollary 1.3). Hence, RJ = (0) or RJ = R. But RI 
cannot be equal to R, since R has unity ¥ 0. Therefore, R/ = (0). But then 
for any aE I,a= lae RI = (0). Hence, J = (0). 


Definition. A right (or left) ideal A in a ring R is called a nil ideal if each 
element of A is nilpotent. 


Clearly, every nilpotent right (or left) ideal is nil. However, the con- 
verse is not true, as the following example shows. 


5,2 Example. Let R = © 2Z/(p'), i = 1,2,..., be the direct sum of the 
rings Z/(p‘), p is prime. Then & contains nonzero nilpotent elements, 
such as (0 + (p),p + (p2),0 + (p°),...). Let J be the set of all nilpotent 
elements. Then / is an ideal in R because R is commutative. So / is a nil 
ideal. But / is not nilpotent, for if /* = 0 for some positive integer k > 1, 
then the element 


x= (0 + (p), 0 + (p),...,0 + (p*),p + (p**'),0 + (p**?),...) 


is nilpotent, so it belongs to J. But x* # 0, a contradiction. So / is not 
nilpotent. 


Problems 
1. Show that if A and B are nilpotent ideals, their sum A + B is 
nilpotent. 


2. If R is a commutative ring, show that the sum of nil ideals 
A; ,AQ,...,A, is a nil ideal. 

3. Show that the n X n matrix ring F, over a field F has no nonzero 
nil right or left ideals. 


6 Zorn’s lemma 


A partially ordered set (also called a poset) is a system consisting of a 
nonempty set S and a relation, usually denoted by s, such that the follow- 
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ing conditions are satisfied: For all a,b,c € S, 


(i) asbandbsa>a= b (antisymmetric). 
(ii) asa (reflexive). 
(iii) as band bsc—a sc (transitive). 


A chain C in a poset (S,S) is a subset of S such that, for every a,b € C, 
eithera s borb sa. Anelement u € Sisan upper bound of Cifa s ufor 
every a € C; an element m € S is a maximal element of a poset (S,s) if 
msa,aé€ 5S, implies m = a. 

The following axiom from set theory about posets is widely used in 
mathematics. 


Zorn’s lemma. [f every chain C in a poset (S,s) has an upper bound in S, 
then (S,s) has a maximal element. 


Asasimple example of the use of Zorn’s lemma, we prove the following 
result. 


6.1 Theorem. If R is a nonzero ring with unity 1, and Lis an ideal in R 
such that I #R, then there exists a maximal ideal M of R such that 
ICM. 


Proof. Let S be the set of all ideals in R such that if X eS then X 4 R and 
X 21.(S, ©)1s a partially ordered set under inclusion. We assert that the 
union U of an arbitrary chain C in S is an element of S. For if a,b € U, 
then there exist elements 4 and B of S such that a € A and DE B. But 
since C isachain, we have A C Bor B C A, and, accordingly, a — b & Bor 
a-—b€A. Also, rae B or ra€ A for all rE R. In any case, a— bE U 
and ra € U. Similarly, ar € U. Hence, U is an ideal in R. If U = R, then 
1€ U, so 1 © X for some X € C, which ts not true. Hence, U # R and, 
thus, UE S. Then by Zorn’s lemma S contains a maximal member M. 
We claim that M is a maximal ideal in R. So let M ¢ N, where Nisan ideal 
in R. If N # R, then clearly N € S, acontradiction to the maximality of M 
in S. Hence, VN = R, and thus M is a maximal ideal in R, as desired. O 


Problem 


Use Zorn’s lemma to show that every proper right ideal A in a ring 
R with unity is contained in a maximal right ideal 7. (M 1s called 
a maximal right ideal if M # R, and if M ¢ N, for some night ideal 
N, implies N = M or N= R.) 


CHAPTER I!1 


Unique factorization domains and 
euclidean domains 


] Unique factorization domains 


Throughout this chapter R is a commutative integral domain with unity. 
Such a ring is also called a domain. 

If a and b are nonzero elements in R, we say that b divides a(or bisa 
divisor of a) and that ais divisible by b(or a isa multiple of b) if there exists 
in R an element c such that a = bc, and we write bja or a = 0 (mod 5B). 
Clearly, an element u € R is a unit if and only if u is a divisor of 1. 

Two elements a,b in R are called associates if there existsaunituE R 
such that a = bu. Of course, then b = av, where v = u~!. This means that 
if a and b are associates, then a|b and bja. In fact, if R is a commutative 
integral domain, then the converse is also true; that is, if a|b and dja, then 
aand bare associates. For let b = ax and a = by. Then b = byx. Because 
R is an integral domain with I, this gives yx = 1. Therefore x and y are 
units, and, hence, a and 0 are associates. 

We call an element bin R an improper divisor of an element a € R ifb 
is either a unit or an associate of a. 


Definition. A nonzero element a of an integral domain R with unity is 
called an irreducible element if (i) it is not a unit, and (ii) every divisor ofa 
is improper; that is, a= bc, b,cE R, implies either b orc is a unit. 


Definition. A nonzero element p of an integral domain R with unity is 


called a prime element if (i) it is not a unit, and (ii) if p\ab, then p\a or p\b, 
abe R. 
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It is quite easy to verify that a prime element is an irreducible element. 
But an irreducible element need not be prime (see Problem 5). However, 
we have the following 


1.1 Theorem. An irreducible element in a commutative principal 
ideal domain (PID) is always prime. 


Proof. Let R be a PID, and let p € R be an irreducible element. Suppose 
p\ab, a,b € R. Assume p } a. Because R is a PID, there exists c € R such 
that 


PR+aR=cR. 


Then p € cR, which implies p = cd, d € R. But since pis irreducible, c or 
d must be a unit. Suppose dis a unit. Then pR = cR; so pR + aR = pR. 
This gives a € pR, which contradicts the assumption that p J a. Hence, c 
is a unit; therefore, cR = R, so pR + aR = R. Then there exists x.yE R 
such that px + ay = 1. This implies pbx + aby = b. Hence, p|b because 


pijab. O 


We now define an important class of integral domains that contains the 
class of PIDs, as shown later. 


Definition. A commutative integral domain R with unity is called a unique 
factorization domain (or briefly, a UFD) if it satisfies the following condi- 
tions: 
(i) Every nonunit of R is a finite product of irreducible factors. 
(ii) Every irreducible element is prime. 


The factorization into irreducible elements is unique, as will be shown 
in Theorem 1.3. 


1.2 Examples of UFDs 


(a) The ring of integers Z is a UFD. 

(b) The polynomial ring F[x] over a field F is a UFD. 

(c) Each commutative principal ideal ring with unity that is also an 
integral domain [called a principal ideal domain (PID)] is a UFD (see 
Theorem 2.1). 

(d) The commutative integral domain R = (a + bV—5ja,b € Z) is not 
a UFD. for let r = a + 6b V—5 € R. Define a function N: R — Zsuch that 
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N(r) = rr= a? + 562; Mr) is called the norm of r and has the properties 
that (i) M(r) = 0, (11) M(r) = 0 iff r = 0, (111) Mrs) = N(r)Ms). 

It follows from property (iii) that if rs = 1, then Mr)NM(s) = Mrs) = 
N1)= 1. Thus, Mr) = 1, so a2 + 56? =1. This gives a= +1, b=0. 
Hence, the only units are +1. In particular, the only associates of an 
element are the element itself and its negative. 

Now consider the factorizations 9 = 3 - 3 = (2 + V—5) (2— V—5). We 
claim that 3, 2 ++ V—5, and 2— J—5 are all irreducible. Let 2 + V—5 = 
rs. Then M(2 + V—5) = N(r)Ns). This gives 9 = N(r)N(s). Hence, Nr) = 
1, 3, or 9. But if Mr) = 3 and r= a + bv—5, then a? + 5b? = 3, which is 
impossible for integers a and b. Therefore, either M(r) = 1 or Mr) = 9 and 
N(s) = 1. In the former case r is a unit, and in the latter case s is a unit. So 
2 + V—5 is irreducible. Similarly, 2— /—5 and 3 are irreducible. Thus, R 
is not a UFD in view of the next theorem. 


1.3 Theorem. J//R is a UFD, then the factorization ofany element in R 
as a finite product of irreducible factors is unique to within order and unit 
factors. 


Proof. More explicitly, the theorem states that if a= p,p.°':p,= 
9:92 °°* Gn, where p, and q, are irreducible, then m = n, and on renumber- 
ing the g,, we have that p, and q, are associates, i = 1,2,...,m. 

Because the theorem is obvious for factorizations of irreducible ele- 
ments, we assume it is true for any element of R that can be factored into s 
irreducible factors. We then prove it is true for any element that can be 
factored into s + | irreducible factors. Let 


s+ m 
a= [Tam [2 (1) 

i= j= 
be two factorizations of a into irreducible factors, one of which involves 
exactly s + | factors. We have that p; divides the product of the p;’s, and, 
hence, by axiom (ii) for a UFD, p, must divide one of the elements 
DP; P2>---sPj,. Let p, divide py. Because pj is irreducible, p, and p; are 
associates. Then p; = up,, where u is a unit, and after cancellation of the 
common factor p,, (1) gives 


s+ 


[] = EL ay (2) 


j=l 

jek 
Hence, by the induction hypothesis, the two factorizations in (2) can differ 
only in the order of the factors and by unit factors. Because we already 
know that p, and p; differ by a unit factor, the proof is complete. O 
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In the next theorem we show that ina UFD any pair of elements aand b 
has a greatest common divisor (g.c.d) defined as follows. 


Definition. An element d in an integral domain R is called a greatest 
common divisor of elements a and b in R if 


(i) dla and d\b, and 
(it) ifforcE R, cla, and c\b, then c\d. 


1.4 Theorem. Let R be UFD, and a,b € R. Then there exists a great- 
est common divisor of a and b that is uniquely determined to within an 
arbitrary unit factor. 


Proof. Let us write 
a= pi pe pir, b= ph pz ph, 
where p, are irreducible, e,, f, are nonnegative integers, and by p? we mean 


a unit. Set g,= min(e,, f;), (= 1,2,....m, and d= p*' pP ++: p&". Then 
clearly, dja and d|b. Let c € R be such that cla and c|b. Then cla implies 
C= py! Pp? *** Din 

where /,, i = 1,2,...,m, are nonnegative integers. But then cia and c|b 
imply, by Theorem 1.3, that 1, se, and A, s/,, i= 1,2,...,m. Hence, 
A, = min(e,, f,;) = g;. This proves cd, as desired. 

Now suppose d and d’ are the two greatest common divisors of a and b. 
Then d|d’ and d’\d, and, therefore, they are associates because R is a 
commutative integral domain. O 


The uniquely determined (within an arbitrary unit factor) greatest 
common divisor of a and 5 is denoted by (a,0). 

Note that (a,b) is a set in which every two elements are associates. We 
write (a,b) = c to mean that (a,b) consists of all unit multiples of c. 


Definition. In a UFD two elements a and b are called relatively prime if 
(a,b) = 1. 
Problems 


In the following problems R is a commutative integral domain with unity 
in which for each pair a, beR, g.c.d. (a, b) exists. Let a,b,ceR. 


1. Show that c(a,b) and (ca,cb) are associates. 
2. Show that if (a,b) = 1 and if ajc and bjc, then adjc. 


216 Unique factorization domains and euclidean domains 


Show that if (a,b) = | and bjac, then dlc. 

Show that if (a,b) = 1 and (a,c) = 1, then (a,bc) = |. 

Prove that 2 + V—5 is irreducible but not prime in Z[V— 5]. 
Show that 3 is irreducible but not prime in the ring Z[,/ — 5]. 
Show that in the ring Z[,/ —3], (1+./ —3, 1—./—3)=1. 
Show that in the ring Z[./ — 3] the g.c.d. of 4 and 2 + 2,/—3 
does not exist. 

Find the g.c.d., if it exists, of 10+ 1li and 8 +i in Z [i]. 
Show that there exist irreducible elements which are not prime 


in each of the rings Z[./— — and Z[,/10]. 
11. Let Q[./-—3]= faa a,beéZ, and a,b are both even 


or both oda} Show that the units of Q[,/—3] are +1, 


tit./—-3 
ars, eae 


CONN WMWAaWwW 


S 


2 Principal ideal domains 


Recall that a commutative integral domain R with unity is a principal 
ideal domain (PID) if each ideal in R is of the form (a) = aR, aE R. 


2.1 Theorem. Every PID is a UFD, but a UFD is not necessarily a 
PID. 


Proof. First we show that if R is a principal ideal ring, then R cannot have 
any infinite properly ascending chain of ideals. Therefore, let a,RC 
a,RCa,RC +: be a chain of ideals in R. Let A = Ua,R and a,b € A, 
ré R, Then a€a;R, b€ aR for some i,j. Because either a,R C a,R or 
a,R C a,R, both a and 5b lie in one of the two ideals a,R, a,R, say, in a,R. 
Then a— b€ a,RC A. Also, ar € a,R C A. Hence, A is an ideal in R. 
Because R is a principal ideal ring, A = aR for some a € R. Nowa E A 
implies a € a,R for some k. Further, A = aR C a, RC A gives that A = 
aR = a,R; hence, a,R = a,,.,R = ---, proving our assertion. 

Next, we show that each element a € Risa finite product of irreducible 
elements. If ais irreducible, we are done. So let a = bc, where neither b nor 
cis a unit. If both 5 and c are products of irreducible elements, we are 
done. So let 5 not be a product of irreducible elements, and write b = xy, 
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where x, say, is not a product of irreducible elements. This process leads to 
a properly ascending chain of ideals (a) C (b) C(x)C --- that will con- 
tinue indefinitely if a is not a finite product of irreducible elements. But 
since R cannot possess any infinite properly ascending chain of ideals, we 
conclude that a must be a finite product of irreducible elements. 

To complete the proof that R is a UFD, we need to show that if pjab, 
where p is an irreducible element in R, and a,b € R, then pja or p|b. This 
has been shown in Theorem 1.1. 

Finally, we give an example to show that a UFD need not be a PID. 
Consider the polynomial ring R = F [x,y] over a field F in variables x and 
y. The ideal J = (x) + (y) in F[x,y] cannot be of the form (/(x,y)) for any 
polynomial f(x,y) € F [x,y], because 


(x) + (y) = (S(~%y)) => x = f(xy), y = Uf(%) 


for some c,d (#0) in F. This gives x/c = y/d; that is, dx — cy = 0, which is 
absurd because x and y are independent variables over F. Thus, F'[x,)) is 
not a PID. But F[x,y] is a UFD (Theorem 4.3, proved in Section 4). 0 


3 Euclidean domains 


An important class of unique factorization domains is given by euclidean 
domains or rings admitting a division algorithm. These rings are defined 
as follows: 


Definition. A commutative integral domain E with unity is called a euclid- 
ean domain if there exists a function @: E — Z satisfying the following 
axioms: 


(i) Ifabe E*= E — {0) and bla, then d(b) s P(a). 
(it) For each pair of elements a,b € E, b #0, there exist elements q 
and rin E such that a= bq +r, with d(r) < G(b). 


3.1 Examples of euclidean domains 


(a) The ring of integers Z is a euclidean domain if we set O(n) = |n|,n € Z. 
Then for any two integers a and 5 (¥ 0), the ordinary division algorithm 
yields integers g (= quotient) and r (= remainder) satisfying axiom (11). 
Axiom (i) is clear. 

(b) The polynomial ring F [x] over a field Fina variable x is a euclidean 
domain if, for any nonzero polynomial f= f(x) € F[x], we set O(/) = 
degree of fand ¢(0) = — 1. 

(c) The ring of Gaussian integers R = (m+ nV—1|m,n € Z) is a eu- 
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clidean domain if we set @(m + nV— 1) = m? + n2, Note that forx,y € R, 
we have d(xy) = d(x) d(y), and thus dja implies @(b) s ¢(a), which 
proves (i). 

To prove (ii), letO0 # b € Rand write ab-! = a + BV—1, where aand B 
are rational numbers. Choose integers @ and fy such that Ja — ao| s 4, 
|B — Bol = 4. Then 

a=b(at+ BV—1) 
= b((a& — a9) + &% + (B — Bo) V—1 + Byv—1 
= b(a + BoV—1) + b(a@ — &) + b(B — Bo) V—1. 
Because ay By © Z, g = Hy + By V—1 E R. Thus, a— b(ao + By V—1) € 
R. Hence, r = b(a — ay) + b(B — By) V—1 © R. We can then write a = 
bq + r, where g,r © R. Further 


P(r) = d(b)[(a@ — ao¥ + (B — Bo¥) = $ Hd) < (Dd). 


Hence, R is a euclidean domain. 


3.2 Theorem. Every euclidean domain is a PID. 


Proof. Let A be a nonzero ideal in a euclidean domain R. Because for all 
a€A, lla, then d(a) = d(1). Then the set (¢(a)|0 * a E A) is a non- 
empty set of integers with @(1) as a lower bound. So by the principle of 
well-ordering of integers there exists an element d € A such that @(d@) is 
the smallest in this set. We claim A =(d). If a€ A, write a= qd +r, 
gr & R and G(r) < d(d). But since r= a — gdE A and G(r) = G(d) by 
the choice of d, r must be zero. Hence, a€(d). O 


3.3 Theorem. Every euclidean domain is a UFD. 


Proof. Follows from Theorems 2.1 and 3.2. O 


Remark. There exists an interesting class of principal ideal domains that 
are not euclidean domains. Consider, for example, 


Z[{V— 19] = (a + bV—19ja,b € Zand a,b have the same parity). 


(a and b have the same parity if both are odd or both are even.) It requires 
quite tedious computations to show that Z{[V— 19] is a PID but not a 
euclidean domain. The interested reader is referred to J. C. Wilson, A 
principal ideal ring that is not a Euclidean ring, Math. Mag. 46, 34-8 
(1973). 
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Problems 


I. 


‘© 90 


10. 


4 


Let R be a euclidean domain. Prove the following: 

(a) If b #0, then (0) < G(d). 

(b) Ifaand dare associates, then d(a) = (bd). 

(c) If aljb and ¢(a) = ¢(b), then a and b are associates. 


Show that each of the rings Z[./2] and Z[./ —2] is a (i) eucli- 
dean domain and (ii) UFD. Explain why in the UFD Z(./2], 
(5+ ./2) (2—./2)=(11 — 7,/2)(2 + /2) even though each of 
the factors is irreducible. 

Let R be a commutative integral domain with unity that is not a 

field; show that the polynomial ring R[x] in a variable x is not a 

PID. 

Let a= 3 + 2iand b = 2 — 3i be two elements in Z[i]. Find g and 
rin Z[i] such that a= bq + rand g(r) < ¢(b), where g(x + iy) = 

x? + y?. 

Prove that every nonzero prime ideal in a euclidean domain is 
maximal. 

Prove that if D is a domain that is not a field, then D[x] is not a 
euclidean domain. 

Prove that in any PID every ideal is a unique product of prime 
ideals 

Show that Z[./ —6] is not a euclidean domain. 

Let R be a euclidean domain. Show that aeéR is a unit if and 
only if p(a) = p(1). Conclude that the only units of Z[i] are +1, 
+i. 

An ideal P in a commutative ring R is called primary if for all 
x, yER, xyeP, x¢P implies y"eP for some positive integer n. 
Show that if P is a primary ideal then 


/P = {xeR|x"eP for some n > 0} 


is a prime ideal. (./P is called the radical of P.) 

Let R be a PID. Show each primary ideal P is of the form (p‘°), 
e > 0, for some prime element p. Give an example to show that 
this may not be true if R is not a PID. [Consider the ideal (x?, y) 


in F[x, y].] 


Polynomial rings over UFD 


Recall that the ring of polynomials R[x] over a ring R in x (called a 
variable or an indeterminate) is the set of formal expressions a, + 


220 Unique factorization domains and euclidean domains 


a,x+ +--+ +a,x", a,;ER, or, equivalently, the set of sequences 
(@o,4;,2,...), a; © R, where all but a finite number of a,’s are zero. 

If f= (a,) isa nonzero polynomial and nis the greatest integer such that 
a,, ¥ 0, then n is called the degree of f, and a, is called the leading coeffi- 
cient of f. Also, if 1 € Randa, = 1, then fis called monic; in other words, 
a polynomial fis monic if its leading coefficient is |. The degree of a zero 
polynomial is generally defined to be — ~. 

The natural mapping a +> (a,0,0,...) is an isomorphism of R into R[x]. 
Hence, R can be embedded in R[x]. We shall regard R as a subring of R[x] 
by identifying a with (a,0,0....). 

Let R bea ring, and let S = R[x] bea polynomial ring over R. If we start 
with S and construct the polynomial ring S[y] over S in the indetermi- 
nate, y, for example, then S[y] is called a polynomial ring in two indeter- 
minates or variables x,y over R. We write this ring as R[x,y]. It follows 
from the definition that R[x,y] = R[ y,x]. Also, a typical element of R[x,y] 
is of the form 


m n 

> > ayx'y, ai E R, 

i=0 j=0 
where a,, x'/ denotes aX‘ if = 0, ao; y/ if i = 0, and doy if ‘and jare both 
zero. Inductively, we can define a polynomial ring in a finite number of 
variables. 

The polynomial rings over a commutative integral domain have a 

division algorithm, which is given in the following 


4.1 Theorem. Let R = F [x] bea polynomial ring over a commutative 
integral domain F. Let f(x) and g(x) #0 be polynomials in F[x] of degrees 
mand n, respectively. Letk = max(m — n+ 1,0), and let a be the leading 
coefficient of g(x). Then there exist unique polynomials q(x) and r(x) in 
F [x] such that 


a*f(x) = g(x)g(x) + r(x), 
where r(x) = 0 or r(x) has degree less than the degree of g(x). 


Proof. If m < n, we take q(x) = 0 and r(x) = f(x). So let m> nandk= 
m—n-+ 1. We prove the theorem by induction on m. We assume it is 
true for all polynomials of degree < m, and we prove it for polynomials of 
degree m. 

Now the polynomial af(x) — bx™~"g(x) has degree at most m — 1, 
where b is the leading coefficient of f° By the induction hypothesis there 
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exist polynomials g, (x) and r,(x) such that 

air" D—MF! (af(x) — bx™"g(x)) = Gy (x)g(x) + 1; (x). 
Then 

a* f(x) = (ba"—"x™—" + gy (x))g(x) + r(x), 
as desired. Uniqueness follows immediately. O 


Now our aim is to show that polynomial rings over a UFD are also 
UFD. Because a polynomial ring F [x] over a field F, which is trivially a 
UFD, is also a UFD [Example 3.1(b) and Theorem 3.3], our next theorem 
may be regarded as a generalization of this fact. 

We first need some preliminaries. 


Definition. Let R be a UFD. Then f(x) € R[x] is called primitive if the 
g.c.d. of its coefficients is a unit. 


We observe that it is possible to write any nonzero polynomial f 
S(x) € R[x] in the form f= cf,, where c € R and f(x) is primitive, by 
choosing c to be equal to the g.c.d. of the coefficients of f(x). Any element c 
satisfying the condition f= cf,, where /, is primitive, is necessarily a g.c.d. 
of the coefficients of f(x); therefore, cis determined to within a unit factor. 
The factor c is called the content of f(x) and is denoted by c( /). Note that 
f(x) is primitive if and only if c(/) is a unit. 

In what follows we assume that R is a UFD. 


4.2 Lemma (Gauss). /f f(x), g(x) € R[x], then c( fg) = c(f)c(g). In 
particular, the product of two primitive polynomials is primitive. 


Proof. Writing c = c(/) and d = c(g), we have f(x) = cf/,(x) and g(x) = 
dg,(x), where f/, and g, are primitive. Because fg = (cd) f,g,), we need 
only prove that /,g, is primitive. Suppose fg, is not primitive and let p be 
an irreducible element of R that divides all the coefficients of /,g,. If 
S,\ (x) = 2 a,x! and g,(x) = = b,x’, a,, b;E R, let a, and 5, be the first 
coefficients of f, and g,, respectively, that are not divisible by p (the 
existence of a, and 5, follows from the pnmitivity of /, and g, ). Now the 
coefficient of x‘*' in f, (x)g,(x) is 


mise + a, 0,4 + a,b, + Q5410,-) + cae 
Because R isa UFD, p / a, 6,. Because p divides all terms of the above sum 
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that precede and follow a,b,, it does not divide the sum itself, a contradic- 
tion. Hence, /,(x)g,(x) is primitive. O 


We now prove 


43 Theorem. Let R be a unique factorization domain. Then the poly- 
nomial ring R[x] over R is also a unique factorization domain. 


Proof. We first show that each nonunit element of R[x] factors into irre- 
ducible ones. Without any loss of generality, it suffices to prove the 
theorem for polynomials f(x) € R[x] that are primitive. We prove the 
result by induction on the degree ofa polynomial. If the degree of f(x) = 1, 
then the result clearly holds because R is a UFD. Now assume that the 
degree of f(x) = n> 1, and the result is true for all polynomials of degree 
<n. If f(x) is irreducible over R, we are done. Otherwise, write /(x) = 
Si (x)f2(x), where f, (x) and f,(x) are nonconstant polynomials over R of 
degrees n, and n,, respectively. Thus, 7, <n and n, <n; therefore, the 
result follows by induction. 

We now show that if p(x), f(x), g(x) € R[x] and p(x) is an irreducible 
element in R[x] such that p(x)| /(x)g(x), then p(x)|/(x) or p(x)|g(x). If the 
degree of p(x) is zero, then p(x) = a € R. Then 


P(x)If(x)g (x) = ah(x) = f(xdg(x), A(X) € Ri]. 


This implies ac(h) = c(f)c(g). But since a is irreducible, this yields 
alc( /) or aic(g). Hence, a| f(x) or alg(x). Assume that the degree of p(x) is 
positive, and let p(x) not divide f(x). Consider the ideal S generated by 
f(x) and p(x) in R[x]. The elements of S are of the form A(x)p(x) + 
B(x) f(x), where A(x), B(x) € R[x]. Let 0 * d(x) € S be of the smallest 
degree, and let a be its leading coefficient. By Theorem 4.1 there exists a 
nonnegative integer k and polynomials h(x) and r(x) such that a*/(x) = 
(x)h(x) + r(x), where either r(x) = 0 or the degree of r(x) is less than that 
of @(x). This implies r(x) € S, a contradiction unless r(x) = 0; so 


a* f(x) = h(x)h(x) = c(h) gh, (x)A(x), 


where @,(x) is primitive. Now @,(x) divides a*f(x) implies },(x)t(x) = 
a*f(x), which yields c(t) = a*c(/). Therefore, a*|c(t) and, hence, a*|t(x). 
This, along with @,(x)t(x) = a*f(x) and the fact that R[x] is an integral 
domain, yields @,(x) divides f(x). Similarly, @,(x) divides ‘p(x). Because 
p(x) is irreducible and does not divide f(x), it follows that @,(x) is a unit in 
R[x]. Hence, ¢,(x) € R. Thus, 6(x) = c(h), (x) € R; so (x) = aE R, 
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and we can write 
a = A(x)p(x) + B(x) f(x) 


for some A(x), B(x) in R[x]. Then by multiplying by g(x) on both sides of 
the above equation, we get 


ag(x) = A(x)p(x)g(x) + B(x) f(x)g(x). 


This yields p(x)jag(x), since, by hypothesis, p(x)|/(x)g(x). Thus, 
ag(x) = p(x)t(x), t(x) € R[x]. By equating the contents of the polyno- 
mials on both sides and using Lemma 4.2, we obtain 


ac(g) = c(p)c(t) = c(t), 

because p(x) is irreducible. Thus, a{c(t) and therefore ajt(x). Hence, 
B(x) = p(x)t’(x), —t’(x) € R[x). 

This proves that p(x)|g(x), as desired. O 


Remark. It follows immediately from Theorem 4.3 that if R is a UFD, 
then the polynomial ring R[x, ,x,...,x,] over R in a finite number of 
variables x;, i = 1,2,...,n, is also a UFD. 


Problems 


1. Show that the polynomial ring F [x,y] in two variables over a field 
F is a UFD but not a PID. 

2. Let F[x] be polynomial ring over a field F. Show that a nonzero 
polynomial f(x) € F [x] is a unit if and only if f(x) € F. 

3. Let R be a commutative ring with unity. Show that an element 
f(x) € R[x] is a zero divisor if and only if there exists an element 
0 # 6E R such that bf(x) = 0. 

4. Show that the n X n matrix ring (R[x]),, over a polynomial ring 
R[x] is isomorphic to the polynomial ring R,,[x] over the n X n 
matrix ring R,. 


CHAPTER 12 


Rings of fractions 


Let R be a commutative ring containing regular elements; that is, ele- 
ments a € R such that a # 0 and ais not a zero divisor. In this chapter we 
show that any commutative ring R with regular elements can be embed- 
ded in a ring Qwith unity such that every regular element of R is invertible 
in Q. In particular, any integral domain can be embedded in a field. 
Indeed, by defining the general notion of ring of fractions with respect to a 
multiplicative subset S, we obtain a ring R, such that there is a canonical 
homomorphism from R to R,. The conditions under which a noncom- 
mutative integral domain can be embedded in a division ring are also 
discussed. 


] Rings of fractions 


Definition. A nonempty subset S of a ring R is called a multiplicative set if 
for all s;,5,€ S, we have s,s,€ S. If, in addition, each element of S is 
regular, then S is called a regular multiplicative set. 


Clearly, the set of all regular elements is a regular multiplicative set. In 
particular, if R is an integral domain, then S = R — {0} is a regular multi- 
plicative set. 

Let R be a commutative ring and S a multiplicative set. Define a 
relation ~ on RXS by (r,s) ~ (r’,s’) if there exists s’’ € S such that 
s(rs’ — r’s) =0. 
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1.1 Lemma, ~ is an equivalence relation. 
Proof. The proof follows by routine verification. O 


Let a/s denote the equivalence class determined by (a,s), and let Rs 
denote the set of all equivalence classes. Our purpose is to make R, into a 
ring by defining addition and multiplication in a natural manner, remi- 
niscent of addition and multiplication of “fractions” in anthmetic. We 
define 


a/s, + b/s, = (as, + bS,)/S, 52, 
a/s,:b/s, = ab/s, Sy. 


1.2 Theorem. (R5,+,:) is @ ring with unity. 


Proof. It is routine to verify that addition and multiplication as defined 
preceding the theorem are well defined and that (R,,+,-) is a ring. The 
reader is encouraged to verify these facts for the sake of getting familiar 
with the notion. It may be noted that the zero element of R, is 0/s for any 
s€ S, and s/s is the unity foranysE S. O 


The ring R, is called the ring of fractions of R with respect to S or the 
localizationof Rat S; Rsisalso called the quotient ring of R with respect to 
S. (Warning: The quotient ring of a ring R modulo an ideal /, that is, R//, 
is altogether a different notion, as defined in Chapter 10.) If S consists of 
all the regular elements of R, then R,is usually known as the total quotient 
ring of R. Finally, if R, is a ring of fractions where 0 € S, then R, is the 
zero ring. Therefore, we assume throughout this section that 0 ¢ S. 


1.3 Theorem. Let R be a commutative ring, and let S be a multiplica- 
tive subset of R. Then there exists a natural homomorphism f; R— Rs 
defined by f(a) = as/s, where aE R, ands ES is some fixed element. 
Further, f is a monomorphism iff xa = 0, x E S, aE R implies a = 0. 


Proof. Let a,b € R. Then 


f(a + b) = (a+ b)s/s = as/s + bs/s = f(a) + f(b), 
(ab) = abs/s = as/s bs/s = f(a) f(b). 


Hence, fis a homomorphism. Next, 


a & Ker fe as/s = 0/s = t(as?—Os)™=0 #£forsometEeS 
= ats?=Q0eax=0 #$£forsomexeEsS. 
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Therefore, Ker f= (0) iff ax = 0 impliesa=0. O 


As a consequence of Theorem 1.3, we have 


1.4 Theorem. Let R be a commutative ring with regular elements. Let 
S be the set ofall regular elements in R. Then the ring Rs has the following 
properties. 


(i) Ris embeddable in Rs. 
Regarding R as a subring of Rs, we have 


(ii) each regular element of R is invertible in Rs, and 
(iii) each element of Rs is of the form as“', whereaE R,sES. 


Proof. (i) follows from Theorem 1.3. Further, if ais a regular element in R, 
then identifying a with ab/b € Rs, where b issome element in S, it is clear 
that b/ab € Rg, is the inverse of ab/b, because ab € S. Finally, to prove 
(iii), let a/s € Rs. Then a/s = (ab/bXb/bs) = as~'. O 


Definition. Let R be any ring (not necessarily commutative), and let Q bea 

ring with unity containing R as a subring such that each regular element of 
R is invertible in Q and each element of Q is of the form ab~' (a~'b), where 

a€ Rand b is a regular element of R. Then Q is called a right (left) 

quotient ring of R. Jf R is commutative, then Q is called a quotient ring of 
R. 


15 Theorem. Any commutative integral domain R can be embedded 
in a field Rs (called the field of fractions of R). 


Proof. Let S = R — {0}. Then Sisa regular multiplicative set. By Theorem 
1.4, Rs is a field containing R asasubring. O 


Definition. A ring R with unity is called a local ring if it has a unique 
maximal right ideal. 


1.6 Theorem. Let R be a commutative ring and P a prime ideal. Then 
S=R-—P is a multiplicative set, and Rs is a local ring with unique 
maximal ideal Ps = {a/s\a & P, s € P). 


Proof. That Sis a multiplicative set follows from the definition of a prime 
ideal. We first show that P, is an ideal in Ry. Let a,/s,, @3/S_ € Ps 
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and r/sE Rs. Then @,/s, — a2/s, = (a,S, — a2S,)/S,5. & Ps. Further, 
(a,/s,Xr/s) = a,r/s,s © Ps. Hence, Ps is an ideal. To show that P, is 
maximal, let r/s ¢ P,. Then r¢ P. Thus, s/r € Rs, so r/s is invertible. 
This shows that all the elements of R, that lie outside Ps are invertible. 
Hence, P, 1s the unique maximal ideal in R,. This proves that R, isa local 


ring. O 


1.7 Example. Let R be a UFD, and let S be a multiplicative subset of 
R containing the unity of R. Then R, is also a UFD. 


Solution. First we show that if a € R is irreducible in R, then a/1 is 
irreducible in Ry. For, let a/1 = (b/s,\(c/s,), where b/s, and c/s, are 
nonunits in R,. Then as,s, = bc. Because a is irreducible and, hence, 
prime, it follows that a|b or alc. For definiteness let a/b; so ab, = 6 for 
some 6, € R. Then a/1 = (b/s, c/s) => 1 = (6,/s,(c/s,), a contradic- 
tion, because c/s, is not a unit. Now let a/seR,. Write a= a,a,---a,aSa 
product of irreducible elements in R. Then a/s = (1/s)(a, /1)(a2/1)---(a,/1) 
is a product of irreducible elements in Ry. This proves one of the 
conditions for Rs to be a UFD. 

We proceed to prove the second condition, that if a/s € Rg is irreduc- 
ible then it is prime. Now a/s irreducible implies a is irreducible in R, so a 
is prime in R. We prove a/1 is prime in Ry. Let a/1 divide (b/s,)(c/s3). 
Then (a/1\d/s;) = (b/s,(c/s,.) for some d/s,;€ Rs. This implies 
ads, s, = bcs;,s0 a\bcs,. But then a divides b,c, or s;. If a|s3, it follows that 
a/s|1, so a/s is a unit in Rs, a contradiction. Thus, a|b or a|c. This implies 
a/s divides b/s, or c/s, as desired. O 


Problems 
In Problems | -3, S is a multiplicative subset of a commutative ring R 
with unity and | € S. 
1. Letj: R — Rg be the canonical homomorphism given by /(a) = 
as/s for some fixed s € S. If g: R — R’ is any homomorphism of 
a ring R into a ring R’ such that every element g(s), sE S, is a 
unit, then there exists ahomomorphism /: R, — R’ such that the 
following diagram is commutative: 


ES ’ 
R - R 
”” 
s” 
f Oh 
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This is also expressed by saying that g factors through f- 

Let R be a PID. Show that R, is also a PID. 

Let R be a commutative ring with | having no infinite properly 
ascending chain of ideals in R. Show that the same holds for Rs. 
4. Let p be prime. Show that the ring 


Zip) = {a/b\a,b € Z, pyb) 
is a local ring. What is the unique maximal ideal? 

5. Let R(x) and R(x, y) be fields of rational functions over R. Is it 
true that R(x) and R(x, y) are isomorphic fields? 


ate tad 


2 Rings with Ore condition 


We now consider the question of embedding any integral domain in a 
division ring. 


Definition. An integral domain R is called a right (left) Ore domain if for 
any pair of nonzero elements a,b € R there exist nonzero elements x,y € 


R such that ax = by (xa = yb). 


We note that an integral domain R is a right Ore domain if and only if 
every pair of nonzero right ideals has a nonzero intersection. Suppose, 
first, that R is a right Ore domain. Let J and J be nonzero right ideals in R. 
Choose 0 # a € Jand 0 # bE J. Then there exist 0 * x,y € R such that 
ax = by. Since ax = byE IN J, it follows that 7M J # (0). The converse 
is similar. 

We also remark that it follows from the definition that any commuta- 
tive integral domain is a right Ore domain as well as a left Ore domain. 

Let R be an integral domain, and let / be a nonzero right ideal of R. 
Denote by Hom,{/,R) the set of mappings f: J— R such that (1) 
S(x + y) = f(x) + f(y), and (ii) (xr) = f(x)r for all x, y € J, r € R. A map- 
ping with these two properties is known as an R-linear mapping. We note 
that if fis an R-linear mapping, then fis an additive group homomor- 
phism. Thus, f(0) = 0 and f(— x) = — f(x). 

Throughout this section € will denote the set of nonzero right ideals of 
a ring R. 


2.1 Lemma. Let R be a right Ore domain. Then 


(i) € is closed under intersection. 
(ii) If f; I—R is an R-linear mapping, then f-(X)€@ for any 
XE. 
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(Recall that f~\(X) = {ael| f(a)eX}.) 


Proof. (i) Let I, J € @. Then because R is a right Ore domain, JN J # (0); 
solINnJE €. 

(11) Let a,b € f-'(X) and rE R. Then f(a), f(b) € X, so f(a) — f(b) € 
X. Thus a— b€ f~'(X). Also, because X is a right ideal, f(a)r € X. So 
f(ar) € X. This implies ar € f~'(X). This proves that f~'(X) is a right 
ideal. 

By using the hypothesis that R is a right Ore domain, we show that 
S~\(X) # (0). For, if f(/) = 0, then f(/) C X. SoC f-"(X). On the other 
hand, if f(/) # 0, then because R is a right Ore domain, f(/) NX # (0). 
This implies that there exists 0 ¥ a € J such that f(a) € X. Therefore, 
0#aeEf~'(X). This proves that f/—'(X¥)E@. O 


Let R be a right Ore domain and H = U,...Hom(/,R). Define a rela- 
tion ~ on H by f~ gif f= g on some 4 E€ €. 


2.2 Lemma. ~ is an equivalence relation. 


Proof. Clearly, ~ is reflexive and symmetric. We now show ~ is transitive. 
Let f~ gand g ~ h. Then f= gonsomeA & € and g= honsome BE €. 
But then f= honAN BE @. Hence f ~A. This proves that ~ is an equiva- 
lence relation. O 


Denote by [ /] the equivalence class determined by f€ H, and by Qthe 
set H/~ of equivalence classes. Our purpose now is to make Q = H/~ into 
a ring by suitably defining addition and multiplication. In what follows let 
Dom f denote the domain of a mapping f- We define addition in H as 
follows: 


(f] + [8] = (f+ 3], 
where 
f+ g: Dom fN Dom g— R. 


First, note that Dom f{N Dom g € @, so f+ g € H. To show that addition 
is well defined, let [ f] = [/’] and [g] = [g’]. Then f=’ on some A € @ 
and g=g’ on some BE @. This yields f+ g=/f’ +g’ on ANBE &. 
Hence, [ f + g] = [/’ +g’ ]. Also, the addition is commutative and asso- 
Clative, [0] is the identity element in (Q,+) and [— /] is the additive inverse 
of [ f]. Thus, (Q,+) is an additive abelian group. 
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Next, we define multiplication in H as follows: 


[S}[g] = (4a), 


where 
Sg: g~'(Dom f) — R. 

By Lemma 2.1, g~'(Dom /) € @, so fg € H. To show that multiplication 
is well defined, let [ /] = [/’] and [g] = [g’]. Then f=’ on, say, A € @, 
and g = g’on, say, B € @. Weclaim that fg = /’g’ on BN g7 (A) € €. Let 
x € BN g- (A). Then x € B implies g(x) = g(x) and g(x) € A implies 
S(g(x)) = f'(g(x)). Hence, (/eXx) = (f’g’\(x). This proves [ fg] = [/’g’]. 
It is straightforward to check that (Q,:) is a semigroup and that the 
distributive laws 


[S\(Ce) + (A]) = (/ Eg) + (SILA), 
({4] + [s)){4] = (414) + (gf) 


hold. Also, if J: R — Ris the identity mapping, then [/] is the unity of Q. 
Thus, Q is a ring with unity. We now show that Q is a division ring. 


2.3. Theorem. Let R be a right Ore domain. Then Q™= Uj 
Hom(I,R)/~ is a division ring. 


Proof. Let [ f] be a nonzero element in Q. If Ker f* (0), then Ker fe @. 
Now f= the zero mapping on Ker f. Therefore, [ f] = [0], a contradic- 
tion. This gives Ker f= (0). Define g: Im f— R by g(/(x)) = x, x € Dom 
f. The mapping g is well defined because Ker f= (0). But then gf= 
identity mapping / on Dom /; therefore [gf] = [J]. This gives [g][ /] = 
[7], so [/] has a left inverse. This proves that Q is a division ring. O 


2.4 Theorem. Let R be an integral domain. Then R is a right Ore 
domain if and only if there exists a division ring Q such that 


(i) Risa subring of Q. 
(ii) Every element of Q is of the form ab™', for some a,b, € R. 


(Recall that the ring Q with properties (i) and (ii) is called a right quotient 
ring of R.) 


Proof. Let R be a right Ore domain. Then by Theorem 2.3, Q= 
U je~¢Hom(/,R)/~ isa division ring. We assert that the mapping a > [a*] 
of R into Q, where a* € Hom,(R, R), given by a*(x) = ax, isa 1-1 homo- 
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morphism. For, 
(a + b)*(x) = (a + b)x = ax + Ox = a*(x) + b*(x) = (a* + b*Xx), 
so (a + b)* = a* + b*. This implies [(a + 5)*) = [a* + b*) = [a*] + 
[b*). Further, 
(ab)*(x) = (ab)x = a(bx) = a(b*x) = a*(b*x) = (a*b*)\(x). 


Therefore, (ab)* = a*b*, so [(ab)*] = [a*][b*]. To show that the map- 
ping is 1-1, let [a*] = [0]. Then a* =0 on some /€ ©. This implies 
a*] = (0); that is, a/ = 0. But because R is an integral domain, this implies 
a = (0. This proves that the mapping is 1-1. Hence, R can be embedded in 
a division ring Q. We may then identify R with its copy in Q and regard R 
as a subring of Q. This proves (i). We may remark that under this identifi- 
cation ifg € Qanda € R, then g(a) is identified with ga, the composition 
of mappings g and a. For, g(aXb) = q(ab), by definition of an R-linear 
mapping g. Also, considering g and a (=a*) as mappings, (ga\b) = 
(ga*\b) = q(a*b) = q(ab). Therefore, under the stated identification if 
gE Q,aeER, then q(a) = ga. 

To prove (ii), let g € Q. By construction of Q there exists / E € such 
that g(/) C R. LetO * bE J. Then there exists a € R such that q(b) = a; 
that is, gb = a. This implies g = ab~' because every nonzero element in Q 
is invertible. 

Conversely, let a and b be nonzero elements in R. Then a~'b € Q, so 
a~'b= xy~'! for some nonzero elements x,y € R. This implies ax = by. 
Hence, R is a right Ore domain. O 


Problems 


1. IfRisan integral domain with unity in which each night ideal is of 
the form aR, a € R(i.e., Risa right principal ideal domain), show 
that R is a right Ore domain. 

2. Let R be an integral domain satisfying the standard identity of 
degree 3; that is, for every ordered set (a, ,a,,a3) of elements in R, 


> + a;,4,,4;, = 0, 

where the summation runs over every permutation (i, ,/,/3) of 
(1,2,3), and the sign of the corresponding term a, ,@,,q,, iS positive 
or negative according to whether (i, ,i.,/)) is an even permutation 
or an odd permutation. Show that R is a right Ore domain and a 
left Ore domain. 
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Show that an integral domain satisfying a standard identity of 
any degree nis both a right and a left Ore domain. 

Let F be a field, and let R= ((8 °)ja,b € F). Show that (¢ °) 
is regular if and only if a #0, and every regular element in R 
is invertible in R. Conclude that R is its own night and left ring 
of quotients. (We may remark that a right, as well as a left, ring 
of quotients of R= ((g °)ja,b,c € F) is the 2 X 2 matrix ring 
over F.) 


CHAPTER 13 


Integers 


In this chapter our purpose is to establish all the familiar properties of the 
natural numbers (= positive integers) and to obtain the ring of integers by 
starting from the five axioms of Peano. We also demonstrate that the five 
axioms are equivalent to the axioms of an ordered integral domain whose 
positive elements are well-ordered. Either of these sets of axioms deter- 
mines a unique ring (up to isomorphism) called the ring of integers. 


| Peano’s axioms 


The traditional method of describing the set N of natural numbers axio- 
matically is by means of the following axioms of Peano: 


(i) IEN. 

(11) Foreach a € N there exists a unique a’ E N called the successor 
of a. (In other words there exists a map a a’ of N into itself, 
called the successor map.) 

(ii) a’ #1 foranyaEN. 

(iv) Foralla,b€N, a’ = 6’ = a= 5. (In other words, the successor 
map a— a’ of N into itself is 1-1.) 

(v) Let Sbeasubset of N such that (a) 1 € S;(b)ifa € S,thena’ E€ S. 
Then S=N. 

The fifth axiom is called the axiom of induction or the first principle of 

induction and is the basis of the proofs of many theorems in mathematics. 

We write 1’ = 2, 2’ = 3, 3’ = 4, and so on. 

We may restate the first principle of induction (also called the principle 

of mathematical induction) as follows: 
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(v’) Suppose that for each natural number n € N we have associated a 
statement S(n) such that 

(a) S(1) 1s true. 

(b) S(n) is true implies S(n’) is also true. 
Then S() is true for all n EN. 

The following are some of the immediate consequences of Peano’s 
axioms. 


1.1 Corollary. For anyaEN, a’ # a. 


Proof. Let S= {nE N|n’ # n). By axioms (i) and (iii), 1 E S. Letn ES. 
Then n’ # n; so (n’)’ # n’, for by axiom (iv), (n’)’ =n’ implies n’ = n. 
Consequently, n’ € S. Axiom (v) then givesS=N. O 


1.2 Corollary. Foranya EN, a ¥ 1, there exists auniqueb € N such 
that a= b’. 


Proof. Let S = {n € N|jn = | orn = m’ forsome m € N). Then by defini- 
tion of S, 1 € S. Letn € S. Then n = m’ for some mE N, son’ =(m’)’. 
Thus n’ € S. Hence, by axiom (v), S = N. Further, if a = 5’ anda =c’, 
then by axiom (iv), b=c. O 


Addition and multiplication of natural numbers can be defined in a 
“recursive manner.” The next theorem shows the existence and unique- 
ness of a binary operation in N (called plus) satisfying certain conditions. 


13 Theorem. There exists one and only one binary operation “+” in 
N satisfying 
(i) atl=d, 
(ii) a+b’ =(a+t by 


forallabEN. 


Proof. We first show that there exists a binary operation “+” in N satisfy- 
ing (i) and (ii). 
Consider the set 
S = (a € Nia + b can be defined for all b in N 
satisfying conditions (i) and (ii)). 
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To show that | € S, we define 
1+b=5)’ for each DEN. 


Then | + 1 = 1’ by our definition, and 1 + b’ = (b’)’ = (1 + 5b)’ by re- 
peatedly using our definition. Thus, (i) and (ii) are satisfied for the case 
a=Ii;soleEsS. 

Let ae S. Hence, a + b is defined for all b € N. We now define 


a’+b=(at by’. 
Then 
a’+1=(at 1) (by definition) 
= (q’)’ (since a€ S). 
Also, 


a’ +b’ =(a+t+0b’) (by definition) 
= ((a + b)’)’ (since aE S) 
=(a’ + by’ (by definition). 


So a’ € S. Hence, by axiom (v), S=N. 
We now prove that there is exactly one binary operation in N satisfying 
(i) and (11). 
Suppose © is another binary operation in N satisfying (i) and (ii). For 
any fixed a € N let 
S=(nENilatn=a@®n). 
Then | € S because 
a+1=a’ [by (i)] 
=q@® 1 [by (1)}. 
Next, let 2 € S. Then 
atn’=(a+t+n) [by (11)] 
= (qa @ n)’ (since nE S) 
=q@®n’ [by (1i)]. 
Son’ € S. Hence, by axiom (v), S = N. This proves that “+” and “©” are 
the same binary operations. O 


1.4 Theorem. The binary operation + in N satisfies the following 
laws: 


(i) Foralla,b,cEN, 
(a+b)+c=at(b+c) (associative law of addition). 
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(ii) ForallabEN, 
a+b=b+a (commutative law of addition). 


Proof. (i) Let a and b be fixed (but arbitrary) natural numbers, and let 
S={cENi{(at+ b)+c=at(b+0)). 

Then by definition of +, 
(a+ b)+1=(at+ b/ =at+b’=—=at+(b+ 1). 

Thus, 1 € S. Now suppose c € S. Then 


(a+ b)+c’ =((a+ b) +c) (by definition of +) 
=(a+(b+c))’ (since cE S$) 
=q+(b+c) (by definition of +) 
=a+(b+c’) (again by definition of +). 


Thus, c’ € S. Therefore, by axiom (v), it follows that S = N. This proves 
the associative law of addition. 
The proof of (ii) is similar and is left to the reader. O 


The next theorem is the fundamental theorem for defining ordering in 
the set of natural numbers. 


1.5 Theorem. Let a,b E N. Then exactly one of the following state- 
ments holds: 


(i) a=b. 
(ii) a=b+uforsomeuEN. 
(iii) b=atv/forsomevEN. 


Proof. First we show that (i) and (ii) cannot simultaneously hold. Ifa = 5 
anda=b+u, then a=a+tu. We show that this is not possible. Con- 
sider the set S= (nENi[n #n+ uj. Now lt+tu=ut+l1=u’ by the 
properties of the binary operation +. Further, by axiom (iii), 1 # u’. So 
le S. Let nE S. Then n#n+u. Suppose n’ =n’ +u. Then n’= 
(n + u)’, by definition of +, so by axiom (iv), n = n + u, a contradiction. 
Thus, n’ ¥ n’ + u,son’ E S. Therefore, by axiom (v), S = N. In particu- 
lar, a # a+ _u. This proves that (i) and (11) cannot hold simultaneously. 

Similarly, (i) and (iii) cannot hold simultaneously. 

The proof that (11) and (i1) cannot hold together is a straightforward 
application of the associative and commutative laws of addition. 

We now proceed to show that one of the three statements must hold. 
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Let aE N be a fixed (but arbitrary) number. Let 
T = {b E Nione of the three statements in the theorem is true). 


We assert that | € 7, for either a = | or else if a * 1, then by Corollary 
1.2, 


a=u’ (for some u € N) 
=1+u (by definition of +). 


Hence, a = | ora = 1 + u, showing that | € 7, as asserted. Now assume 
that b € T. We show D’ € T. If b = a, then b’ = a’ = a + 1. So (iii) holds 
and, thus, b’ € 7. Ifb = a+u,then db’ = (a+ uy =a + u’. Soagain (iii) 
holds, and, thus, b’e 7. If a= 6+ u, then we have to consider two 


subcases: 
Subcase (A). u= 1. Then a = 6 + 1 = b’, so (i) holds. Thus, b’ € 7. 
Subcase (B). u¥*1. Then u = v’ for some v € N (Corollary 1.2). Thus, 


a=b+u=—b+0' =b+(14+0) =(6+ 1l)+v=bd' +, 
showing that b’ € 7. Hence, 7 = N, which proves the theorem. O 


1.6 Theorem (cancellation laws for addition) 
(i) atuF¥a. 
(it) atxmtatyrmx=y 

for all au,x,yEN. 


Proof. (i) Follows from the previous theorem. 

(11) Case 1.x = y+ vforsomevE N. Thena+x=at+ yatyt 
v =a+t y, a contradiction by (i). 

Case 2. y= x +0’ for some v’ EN. Then, again, a+ x=at+ y= 
a+x=a+x+v’, a contradiction by (i). Hence, by the previous 
theorem x= y. O 


Definition. Let a,b € N. a is said to be greater than b ifa = b + ufor some 
uEN. We denote this bya> b. 


Definition. Let a,b € N. a is said to be less than b if b = a + v for some 
vEN. We denote this by a < b. 


Theorem 1.5 can be restated as 


1.7 Remark (trichotomy law of natural numbers). Given a,b € N, 
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one and only one of the following statements holds: 


(i) a=b. 
(ii) a>Ob. 
(iii) a<b. 


18 Remark. a > b if and only if 5 < a. 


1.9 Remark. If a # 1, then 1 <a. 


Proof.a ¥ | => 3b € Nsuch that a = b’ by Corollary 1.2. Thena = 1+ 5 
by the properties of the binary operation +. Therefore, by definition of less 
than, 1 <a. O 


The notation @ 2 b shall mean that either a = b or a> b. We give a 
similar meaning for a s b. 


1.10 Theorem. Let a,bEN. Thena <b ifand only ifa+1s b. 
Proof.a<b=>b=at+tu for some uEN. If u=1, then b= a+ 1. If 
u# 1, then, by Corollary 1.2, u =v’ for some v E N. Then 


b=atumb=atv’>b=ati(lt+v) 
=b=(at+l)tv->at+1<b. 


Hence, in any case, a + 1 s 5b. The proof of the converse is obvious. O 


We now prove a basic property of the set of natural numbers that is the 
basis of the second principle of mathematical induction. 


1.11. Theorem (well-ordering property of natural numbers). Every 
nonempty set of natural numbers possesses a least member. 


Proof. Let S be the given set, and let 
T=({nENinsa forall ae€é S}. 


Then by Remark 1.9, 1 € 7. Let n € T. Now, for each a € S, a’ > a, so 
a’ & T. This implies T * N. But then by Peano’s axiom (v), there exists 
t€ T such that t’ ¢ 7. We now claim that ¢€ S and is the required 
element. First, by definition of 7, ts a for all aE S. For if t € S, then 
t<a for all aE S. But then ¢ + 1 s a by Theorem 1.10. This gives t’ = 
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t+ 1 € 7, a contradiction. Therefore, € SandtsaforallaeES. O 


We now prove the second principle of induction. 


1.12. Theorem (second principle of induction). Let S be a subset of N 
such that 


(i) 1ES. 
(ii) n€S whenever mé S for all positive integers m <n. 


Then S=N. 


Proof. Let T=({n€ Nin ¢ S). If T = ©, we are done, so assume T # ©. 
Then by the well-ordering property of N, 7 contains a least member, say ¢. 
By (i), 1 € 7, sot ¥ 1. Thus, by Remark 1.9, 1 < ¢. But then all natural 
numbers less than ¢ belong to S. So by hypothesis (ii), ¢ € S, a contradic- 
tion. Hence, 7 = ©; that is, S=N. O 


We now introduce a second binary operation in N to be called multi- 
plication or product. 


1.13 Theorem. There exists one and only one binary operation: inN 
satisfying 

(i) al=a, 

(ii) ab’=ab+a 
for all a, beN. 


We often write a - b simply as ab, which is called the product of a by 5 or 
the “number obtained from multiplication of a by 5.” 


Proof. The proof is similar to the proof of Theorem 1.3, concerning 
existence and uniqueness of the binary operation +. O 


1.14 Theorem. The binary operation of multiplication inN satisfies the 
following laws: 


(i) ab=ba (commutative law ), 
(ii) (ab)c =a(bc) (associative law), 
(iti) a(b+c)=ab+ac (distributive law), 


for all a,b,ceN. 
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Proof. The proofs of (i) and (ii) are exactly the same as those for the 
corresponding laws for addition. 

(iii) a(b + 1) = ab’ = ab+a=ab+a-: 1, by definition of multipli- 
cation. Assume that for a given c € N, we have a(b + c) = ab + ac for all 
a, beN. We prove a(b + c’)= ab + ac’ for all a,beN. Now 


a(b+c’)=a((b+c)’) —_ (by definition of addition) 
= a(b+c)+a (by definition of multiplication) 
=(ab+ac)+a (by induction hypothesis) 
= ab + (ac + a) (by associative law of addition) 
=ab+ac’ (by definition of multiplication). 


Hence, by the pnnciple of induction, a(b+c)=ab+ac for all 
a,bcoEN. O 


2 Integers 


After having given a systematic development of the system of natural 
numbers N, we now extend this to the set Z of integers. The necessity of 
extending the system of natural numbers arises from the fact that an 
equation of the type a = x + b, where a,b, € N, does not always possess a 


solution in N (see Theorem 1.5). 
Consider an equivalence relation ~ on N X N defined as follows: 


(a,b)~ (c,d) iff atd=bte. 


Clearly, ~ is an equivalence relation on N X N. We denote the equiva- 
lence class of (a,b) by (a,b) and define binary operations + and - called, 
respectively, addition and multiplication, in the set N X N/~ of equiva- 
lence classes by the following rules: 

(a,b) + (c,d) = (a +¢,b + d), 

(a,b\(c,d) = (ac + bd,ad + bc). 
These definitions of addition and multiplication are well defined, as.can 
be checked in a routine manner. We note that (1,1) is the identity element 
for addition, and (1 + 1,1) is the identity element for multiplication. For 
we have 


(a,b) + (1,1) = (a+ 1,5 + 1) = (a,b) 


and 


(a,bX1 + 1,1) =(a(l + 1) +b,a + BU + 1) 
=(a+at+bat+b+ b)=(a,b). 


The set N X N/~ of equivalence classes is denoted by Z. 
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2.1 Theorem. (Z,+,:) is a commutative integral domain with unity. 


Proof. (Z,+) is clearly a commutative semigroup with identity (1, 1,1). Be- 
cause (a,b) + (b,a) = (a + b,a + b) = (1,1), it follows that each element 
of (Z,+) has an additive inverse, which proves that (Z,+) is an abelian 
group. 

Further, (Z, -) isa commutative semigroup with identity (1 + 1,1). The 
two distributive laws follow directly. Hence, (Z,+,:) is a commutative 
ring with unity. 

Let us denote, as usual, the additive identity (1, 1,1)by0.’ To prove Z is an 
integral domain, let x,y € Z with xy = 0. Let x = (a,b) and y = (c,d). 
Then 

xy=O0=—(abXfcod)=(11) 
=> (ac + bd,ad + bc) = (1,1) 
= (ac + bd) + 1 =ad+ bct+ 1. 


Suppose x # 0. Then a # b. So either a= b+ uorb=a+tv. 
Case 1.a=6+ u. Then 


ac + bd+ 1 =ad+ bc+ 1 
=bc+uct+ bd+1 
= bd+ud+ bc+! 
=> (bc + bd + 1) + uc 
= (bd + bc + 1) + ud, 
by commutativity and associativity of addition in N. 


But then by Theorem 1.6(ii), uc = ud. Incase c # d, theneitherc=d+s 
ord=c+tifc=dt+ts, then u(d +s) = ud = ud + us = ud, acontra- 
diction by Theorem 1.6(1). Similarly, d = c + ¢ leads to contradiction. 
Hence, uc = ud must imply c = d. So then y = (c,d) = 0. 

Case2.b=a + v. Anexactly similar computation yields that c = d, so 
y=0. 

Hence, (Z,+,:) is an integral domain. O 


2.2 Theorem. N embeds in Z under the mapping n> (n+ 1,1), 
nEN, that preserves addition and multiplication. 


Proof. Let a,oEN and (a+1,1)=(6+ 1,1). Then (a+ 1)+1= 
(6+ 1) + 1. But then by Theorem 1.6(ii), a = 6. Thus N embeds in Z. 
Now 


(at b+ 1l)=(at1+b+1,1+ 1) =(a+ 1,1) +(6+ 1,1) 
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and 
(ab + 1,1) =(ab+1+a+b,1+a+6)=(a+1,1X(b+ 1,1). 


This proves the theorem. O 


2.3 Remark. By Theorem 2.2 we may thus identify each element 7 in 
N with its image (n + 1,1) in Z. Henceforth, we write n for its image 
(n + 1,1), and —” for the additive inverse (1,n 1,n +1). Further, (a,b) = 
(a+ 1,1) +(1,5 + 1) =a + (—)). So we identify (a,b) with a — b. 

Also, the copy of N in Z under this embedding is denoted by Z*. The 
elements of Z are called integers, and those of the subset Z* are called 
positive integers. 

: hg also denote (— € Z|n € Z*), called the set of negative integers, 
yZ. 


2.4 Theorem (trichotomy law of integers). //x € Z, then one and only 
one of the following holds: 


(i) x=0. 
(ii) xEZ*. 
(iii) —-xE Zt. 
Equivalently, Z is the disjoint union of its subsets (0), Z*, and Z. 


Proof. Let x € Z, x * 0. Then 
x = (a,b), a¥b, abeEN. 
Then either a = 6 + u forsome u EN, orb = a + v forsomevE N. We 
show that 
a=b+u if xEZ*, 
and 
b=atv iff —xeE Zt. 
Leta = b+ u. Then 
x = (a,b) = (b + u,b) = (u + 1,1) € Z*. 


Conversely, let x € Z*. Then x = (u + 1,1) for some u € N. But since 
x = (a,b), u+1+6=a+ 1. Then by Theorems 1.4 and 1.6,a= 5+ u, 
as desired. Next. let b = a + v. Then 
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—x = —(a,b) = (b,a) = (a + v,a) = (v + 1,1) € Zt. 
Conversely, let —x € Z*. Then —x =(v + 1, vy + 1,1) fa for some v E N. Again, 


because x=(a,b), —x=(b,a); therefore, (b,a)=(v+ 1,1). Hence, 
b+1=a+v+1; that is, b= a+ v. This completes the proof. O 


We have proved that the ring (Z,+,-) contains a subset Z* that is (1) 
closed under +, (ii) closed under -, and (iii) if x € Z, then one and only 
one of the following is true: x = 0, x € Z*, —x € Z*. The set Z* is called 


the set of positive elements of Z. 


Definition. Any ring R with a subset P of elements satisfying (i), (ii), and 
(iti) in the previous paragraph is called an ordered domain. 


Definition. Let a,b € Z. Then a is said to be greater than b ifa — b € Z*. 
We denote this by a> b. 


Definition. Let a,b € Z. Then a is said to be less than b ifb — a € Z*. We 
denote this bya < b. 


2.5 Remark (trichotomy law of integers). By Theorem 2.4, given 
a,b € Z, one and only one of the following statements holds: 


(i) a=b. 
(ii) a>b. 
(iii) a<b. 


2.6 Remark. a > 5 if and only if b < a. 


The notation a = b shall mean that either a = b or a> 5. We givea 
similar meaning for a s 5. 


2.7 Theorem. The order relations in Z satisfy the following laws: 


(i) Transitive law: Ifa< bandb<c, thena<c. 
(ii) Addition to an inequality: Ifa <b, thna+c<bte. 
(iti) Multiplication of an inequality: Ifa < band0 <c, then ac < be. 
(iv) Lawoftrichotomy: For anya and b inZ, one and only one of the 
relations a= b.a> b. ora<b holds. 
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Proof 


(i) a<bandb<c=>b-—ac—bEZt=(b—a)+(c—bEZ 
=>c—aeZ* >a<c. 


(ii) and (iii) are left to the reader. 
(iv) See Remark 2.5. O 


2.8 Remark. For any ordered domain R, we can define in exactly the 
same manner the order relations >, <, 2, and s, and we can prove the 
laws stated in Theorem 2.4 and Remark 2.5 for the domain Z of integers. 


2.9 Theorem. Jn any ordered ring R all squares of nonzero elements 
are positive. 


Proof. Let 0 # a R. By condition (iii) of the definition of ordered ring, 
either a or — a is positive. In the first case a? is positive by the multiplica- 
tion property of positive elements. In the second case — a is positive. But 
a? = (— a)(— a), by the properties of a ring. Thus, a? is again positive. O 


2.10 Corollary. Jn any ordered ring R with unity e, we have e> 0. 


Proof. e= ee; hence,e>0. O 


2.11. Theorem. (Z,+,-) is an ordered domain with Z* as positive ele- 
ments. Moreover, Z* is well ordered. 


Proof. Conditions (1) and (ii) in the definition of ordered ring for the set Z * 
of positive elements follow from the fact that Z* is the copy of N in Z 
(Theorem 2.2), and these conditions hold in N. Condition (iii) follows 
from Theorem 2.4. Further, Z* is well ordered by Theorem 1.11. © 


We are now ready to prove that the set of integers Z can also be 
characterized as an ordered domain whose positive elements form a well- 
ordered set. 


2.12. Theorem. Any ordered domain D with unity whose positive ele- 
ments are well ordered is isomorphic to the ring of integers Z. 
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Proof. Let e be the unity of D. Define a mapping f: Z —D by /(n) = ne, 
n & Z. Then fis a ring homomorphisn, for 


f(m + n) =(m + nye = me + ne = f(m) + f(n), 
f(mn) = (mn)e = (meXne) = f(m)f(n). 


fis also 1-1, for let n * 0 and ne = 0. Choose m € Z* such that m = nif 
n>0,m=-—nifn<0. Then me=0, m € Z*. Because, in an ordered 
ring, the sum of positive elements is positive, me = e+ e+ -:: + eisalso 
positive. Hence, ne = 0 with n # 0 leads to a contradiction. Therefore, f 
is 1-1. 

We now show Im f= D. If not, let S = D — Im f. We claim S contains 
positive elements. Let x € S. Then — x also belongs to S; otherwise if 
—x€Im f then x € Im f, a contradiction. 

Let m be the minimal element in the set of positive elements of S. 

Case 1. m= e. This is not possible because e € Im f. 

Case 2. m—e>0. We have m—e¢ Im / because m¢ Im /f. But 
m—e<m,acontradiction to the minimality of m. 

Case 3.m— e <0. Then m2 <m < e.Ifm? € Im f, then 3k € Z* such 
that /(k) = m2. But then ke = m? < e, acontradiction. Thus, m? ¢ Im /; 
hence, m? € D — Im f. Again, m2 < m yields a contradiction by the min- 
imality of m. 

Hence, S = ©; therefore, fis an onto mapping. This proves Z = D. O 


The theorem proved above shows that upto isomorphism Z is a unique 
ordered domain whose positive elements are well-ordered. 

We have now completed our program to establish the properties of the 
natural numbers and to obtain the ring of integers by starting from 
Peano’s axioms. 


CHAPTER 14 


Modules and vector spaces 


| Definition and examples 


Let M be an additive abelian group, and let End(A/) be the ring of endo- 
morphisms of M (Section 3.1(c), Chapter 9). Ifr € End(A/), m € M, then 
rm will denote the image of m by r. Therefore 
(i) r(m, +m,)=rm, +rm,, 

(ii) (7, +r2)m=rym+r.m, 

(iit) (ryr2)m=r,(r2m), 

(iv) Im=m, 
where 7,r,,7.2 & R, m,m,,m,& M. 

We then say M isa left module over the ring R = End(M) according to 
the following definition. 


Definition. Let R be a ring, M an additive abelian group, and(r,m) — rm, 
a mapping of R X M into M such that 
(i) r(m, +m,)=rm,+rm,, 

(ii) (ry +r2)m=r,;m+r.m, 

(iii) (ryr,)m=r,(r2m), 

(iv) Im=m if 1eéR, 
for allr,r,,r,€ Rand m,m,,m,€ M. Then M is called a \eft R-module, 
often written as pM. 


If R is a division ring, then a left R-module is called a left vector space 
over R. 
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Often rm is called the scalar multiplication or just multiplication of m 
by ron the left. We define right R-modules similarly. If R is a commuta- 
tive ring and M is a left R-module, then M can be made into a right 
R-module by defining mr = rm. This definition of mr makes M a right 
R-module (verify!). In this case, abstractly speaking, a left R-module is the 
same as a right R-module. Thus, when R is commutative, we do not 
distinguish between left and right R-modules and simply call them 
R-modules. 

We list some elementary properties of an R-module M: 


(i) Om=0, meM, 
(ii) a0=0, aeR, 
(iii) (—a)m= —(am)=a(—m), aeR, meM, 


where 0 on the right sides of (i) and (ii) is the zero of M, and 0 on the left 
side of (i) is the zero of R. 

To prove (i), consider am =(a+0)m=am+0m. To prove (ii), 
consider am = a(m+0)=am + a0. To prove (ii), consider 0 = Om = 
(a +(—a))m = am + (—)m, and also consider 0 = a0 = a(m + (— m)) = 
am + a(—™m). 

Throughout, all modules are left modules unless otherwise stated. 


1.1 Examples of modules 


(a) Let A be any additive abelian group. Then A 1s a left (also right) 
Z-module, because 


k(a, + a.) = ka, + kay, 
(k, - k,)a — k,a + k,a, 
(k,k.)a = k,(k2a), 
la=a 


for all integers k,k, ,k, € Z and for all a,a,,a, € A. 

(b) Let R bea ring. Then R itself can be regarded as a left R-module by 
defining am, m € R, a € R, to be the product of a and m as elements of 
the ring R. 

Then the distributive laws and the associative law for multiplication in 
the ring R show that R is a left R-module. Similarly, R is also a right 
R-module. 

(c) Let M be the set of  X n matrices over a ring R. Then M is a 
module over R, because M is an additive abelian group under the usual 
addition of matrices, and the usual scalar multiplication (ra,) of the 
matrix (a,,) € M by the element r € R satisfies axioms (i)-(iv) for a mod- 
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ule. In particular, the set of | X n (orm X 1) matrices — the set of n-tuples, 
denoted by R” - is a module over R. 
Further, by choosing R = R and n = 2 (or 3), we obtain that the set of 
vectors in a plane (or in space) forms a vector space over the field R. 
(d) (Direct product of modules). Let M and N be R-modules. In the 
cartesian product M X N, define 


(XYAX YW) H(X+EX ty), 
r(x, y) = (rx,ry) 


for all x,x’E M, y,y’EN, and re€R. Then MXN becomes an 
R-module, called the direct product of the R-modules M and N. 


Problems 
1. Show that the polynomial ring R[x] over a ring R is an R-module. 
2. Let Rbearingand let Sdenote the set of all sequences (a,), iE N, 
a,€ R. Define 


(a,) + (b,) =(a,+ 5),  afa,) = (aa,), 


where a,a,,b,€ R. Then S is a left R-module. 
3. Let M be an additive abelian group. Show that there is only one 
way of making it a Z-module. 


2 Submodules and direct sums 


Definition. A nonempty subset N of an R-module M is called an 
R-submodule (or simply submodule) of M if 

(i) a—bENforallabEN. 

(ii) raE NforallaEN,rER. 
Clearly, (0) or simply 0 and M are R-submodules, called trivial submod- 
ules. Jn case R ts a field, N is called a subspace of M. 

We remark that if N is an R-submodule of M, then N is also an 
R-module in its own right. 


2.1 Examples of submodules 


(a) Each left ideal of a ring R is an R-submodule of the left R-module R, 
and conversely. This follows from the definition of a left ideal. 

(b) The subset W = {(a,0,0)ia € F)} of F? is a subspace of the vector 
space F 3, 
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(c) If M is an R-module and x € M, then the set 
Rx = (rx|r€ R) 
is an R-submodule of M, for 


rjX— xX =(r, — r)x € Rx, 
ri(r.X) = (r,r2)x € Rx, for all r,,7, € R. 


(d) If AZ is an R-module and x € M, then the set 
K = (rx + nx|rE€ R, ne Z) 
is an R-submodule of M containing x. Further, if R has unity, then 
K = Rx. First, (K,+) is clearly an abelian subgroup of (4/,+). Next, let 
ae R, rx + nx € K. Then 


a(rx + nx) = a(rx) + a(nx) 
= (ar)x +a(xt+ °°: +x) or arx+a((—x)+ °-°> +(—x)), 


according to whether 7 is a positive or a negative integer. But then 
a(rx + nx) =((ar) tat °*+ t+alx or ((ar)+(—a)+ °*+ +(—a)), 
since a(— x) = (— a)x. Therefore, 

a(rx + nx) = ux for some uE R. 


Hence, a(rx + nx) € K, for all a,rin R and for all n (including 0) in Z. 
Choosing r= 0 € R and n= 1 € Z in rx + nx gives x € K. It is worth 
noting that if ZL is any other R-submodule of / containing x, then L 
contains all elements of the form rx + nx, re R, n€& Z; hence, K C L. 
Thus, K is the smallest R-submodule of MM containing x, usually denoted 
(x). 

Suppose R has unity e. Then we show that K = Rx. Let rx + nx € K. 
For n> 0, 

rx + nx = rx + n(ex) = rx + (ext ++: + ex) 
=(rtet:: +ex=ux 


for some u € R. Thus, rx + nx € Rx. Similarly, if ns 0, then rx + nx € 
Rx. So K C Rx. Trivially, Rx C K. 


2.2 Theorem. Let (N,)je, be a family of R-submodules of an 
R-module M. Then ),-,N, is also an R-submodule. 


Proof. Let x,y € Nie, N,, @ © R. Then for all iE A, x — y € N, and ax € 
N,, because N, are R-submodules. Thus, x — y, ax @ NyeaN,, which 
proves that ,-,N, isan R-submodule. O 
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Let S be a subset of an R-module M. Let f = (N|Nis an R-submodule 
of M containing S). Then.# ¥ @ because M € of. Let K = Ne. Then 
K is the smallest R-submodule of M containing S and is denoted by (S). 
The smallest R-submodule of M containing a subset S is called the 
R-submodule generated by S. If S = (x,,...,X,,) is a finite set, then (S) is 
also written as (X, ,...)Xj») 


Definition. An R-module M is called a finitely generated module if M = 
(x, ,..-X,) for some x, M, 1 s isk. The elements x,,...,X, are said to 


generate M. 


Definition. An R-module M is called a cyclic module if M = (x) for some 
xEM. 


Example 2.1(d) shows that a cyclic module generated by x is precisely 
(rx + nx|r € R,n € Z), and if R has unity then it simplifies to (rx|r € R), 
that is, to Rx. 


2.3 Theorem. /fan R-module M is generated by a Set {X, ,Xq5...5Xn) 
and\ €& R, then M = (r,x, + 2X. + °*+ +17,X,1r, € R). The right side is 
symbolically written i", Rx,. 


Proof. Clearly, if m,m,,m,€ 27.,Rx, and rER, then m,— 
m, € 2, Rx, and rme& =7.,Rx,. Thus, 27, Rx, is an R-submodule 
of M. Also, 1x, = x, Rx,C 2%.,Rx,. Thus, all x, € 27.,Rx,. But since 
M is the smallest submodule of 4/4 containing all x,, the submodule 
2y.., Rx, must be equal to M. O 


If an element mEM can be expressed as m=a,xX,+ °° + 
a,X,,4,& Rand x,€ M, i= 1,...,n, then we say that m is a linear combi- 
nation of the elements x,,...,x, over R. 

We remark that the set of generators of a module need not be unique. 
For example, let S be the set of all polynomials in x over a field F of degree 
<n. Then S is a vector space over F with (1,x,x*,x’,....x"} and 
(1,1 + x,2,23,...,x") as two distinct sets of generators. 


Definition. Let (N,), 1 s isk, be a family of R-submodules of a module 
M. Then the submodule generated by UL.,N,, that is, the smallest sub- 
module containing the submodules N,, 1s isk, is called the sum of 
submodules N,, | s is k, and is denoted by Z*.,N,. 
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2.4 Theorem. Jf (N,), | s isk, is a family of R-submodules of a 
module M, then 


k 
J N, = {xy 2 einai x,|[x,EN;,}. 
f=1 


Proof. Let S= (x, + °*: + x,/x,E N,). If x, + <9 + x, and y, + 

--+ + y, belong to 5S, then 

(Xp For FXE)V— OM He FV) =O TIM) He + eI) 

also belong to S, because x, —y,€ N,, 1 sissk. Also, ifr € R, then 

r(x, + os HX) BX to + 7X, ES, 

because each rx,, 1 s isk, isin N,. Thus, S is a left R-submodule. 
Further, if Kis any left R-submodule that contains each submodule N,, 

then K contains all elements of the form x, + --- +x,,x,€ N,. Thus, K 


contains S. Hence, S is the smallest submodule containing each N,, 1 s 
i = k. Therefore, by definition of 2, N,, 


k 
S=J)N,. a 
int 


Remark. The sum ,<, N, of a family (N;,) je, Of R-submodules of a 
module M is defined similarly as the submodule generated by U,., N;. 
Following exactly the proof of Theorem 2.4, one obtains 


N= S x1x,€ mI, 
finite 


iEA 
where Laie X; Stands for any finite sum of elements of R-submodules N,, 
iE A. 


Definition. The sum 2, .N, of a family (N;),¢, Of R-submodules of an 
R-module M is called a direct sum if each element x of 2, N; can be 
uniquely written as x = 2,x,, where x,€ N, and x,=0 for almost all i 
in A. 


When the sum 2,<,N, is direct, we write it as © £,<, N,. Should it 
happen that A is a finite set {1,...,k}, then the direct sum @ Zien N,is also 


written as VN, © --> ON,. . 
Each N, in the direct sum © 3,<, N, is called a direct summand of the 


direct sum. 
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2.5 
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Theorem. Let (N,)<, be a family of R-submodules of an 


R-module M. Then the following are equivalent: 
(i) Zien N, is a direct sum. 
(ii) O=Z,x,€ Leg N, implies x, = 0 for all i. 
(iti) Ni Zyenjai Ny = (0), EA. 


Proof. Exactly similar to the proof of Theorem 3.2 in Chapter 10. O 


Problems 


Let V = R? be a vector space of 3-tuples over the real field R. 
Determine if W is a subspace of V, where W is the set of all 
(A; ,Az,A;) such that 


(a) A, =0. (d) A, +1 =24A,. 
(b) A, =A. (e) A, +A, 20. 
(c) A, A, = 0. 


Show that the set ofall functions /from the real field R to R can be 
made into a vector space by the usual operations of sum and 
scalar product. 

Let V be the vector space of Problem 2. In each of the following 
determine if W is a subspace, where W is the set of all functions 
f:R-R satisfying 


(a) f(1)=0. (d) f(3) 20. 

(b) f(0)=1. (e) lim ,.., f(t) exists. 

(c) f(3) = 2f(2). 

Modular law. Let A, B, and C be R-submodules of an R-module 
M such that A > B. Show that 


AN(B+C)=Bt (ANC). 


Give an example of three R-submodules A,B,C of an R-module 
M such that 


AN(B+C)#(AN B)+(ANC). 


In other words the set of all R-submodules of an R-module 
M is a modular lattice which is not necessarily distributive. 
Let M be an R-module. Show that the set (x € R[x = 0) is an 
ideal of R. (M is called faithful if this ideal is zero.) 

Let M be an R-module and RM = {%, 7, m,|r,;,& R, m,€& M). 
Show that RM is a submodule of M. (Here = denotes finite sum.) 


I. 


13. 


3 
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Let V = R? be a vector space over a field R. Let 

x, =(1,0,0), x; =(1,1,0), x; = (1,1,1). 

Show that 

V = Rx, + Rx, + Rx;. 

Also show that 


3 
V=® ¥ Rx,. 
i=! 

Let K @ K’ and L © L’ be direct sums of submodules of M such 

that K@ K’ = L©® L’. Show that K = L does not necessarily 

imply K’ = L’. 

Let R be the ring of all 2 X 2 upper triangular matrices over the 

field Z/(2). 

(a) Lust all direct summands of R as a left R-module; that is, all 
left ideals A of the ring R such that 4 © B= R for some left 
ideal B of R. 

(b) Foreach direct summand in (a) list all idempotents generat- 
ing it as a left R-module. 

Let e,,...,e, be pairwise orthogonal idempotents in a ring R with 

unity. Prove that 

(a) ®2, Re, is a direct sum of left ideals. 

(b) e=e, + °*+ +e, is an idempotent of R. 

(c) (@,,...,e,, 1 — e} is a maximal set of pairwise orthogonal 
idempotents of R. 

For any two idempotents e, fin R, prove that 


Re+ Rf= Re® R(f— fe). 


Let I be a left ideal in a ring R with unity. Show that /? = Jif isa 
direct summand of R as a left R-module. 

Let R be acommutative ring, and let / bea finitely generated ideal 
in R with J = J, Show that / is a direct summand of R. Give an 
example to show that if / is not finitely generated, then / need not 
be a direct summand. 


R-homomorphisms and quotient modules 


Definition. Let f be a mapping of an R-module M to an R-module N such 


that 


(i) Sixty) =f(x) + f(y) 
(it) f(rx) = rf(x) 
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forallx,y€ Mandr€ R. Thenfis called an R-linear mapping (or simply 
a linear mapping) or an R-homomorphism of M into N. Hom,(M,N) 
denotes the set of R-homomorphisms of M into N. If M=N, then f is 
called an endomorphism of M, and then the set Hom,z(M,M) is also 
denoted by Endp(M). 

IfR is a field or a division ring, then fis also called a linear transforma- 
tion of the vector space M to the vector space N. 


Let f: M— N be an R-homomorphism of an R-module M into an 
R-module N. Then /, in particular, is a group homomorphism. Thus 


(a) f(0)=0. 
(b) f(—x) =—f(x), x EM. 
(c) f(x— y)=f(x) —S(y), xy EM. 


Definition. Let f M — N bean R-homomorphism of an R-module M into 
an R-module N. Then 


(a) ThesetK =(x € M{|f(x) = 0) is called the kernel of fand written 
Ker f | 

(b) The set {(M) = (f(x)|x € M} is called the homomorphic image 
(or simply image) of M under f and is denoted by Im f. 


It can be easily proved that Ker fis an R-submodule of M, and that Im f 
isan R-submodule of (Problem 1). Further, Ker f= (0) ifand only iff is 
1-1. If fis 1-1, we say that M is isomorphic (or R-isomorphic) into N, or M 
is embeddable in N, or there is acopy of Min N, and we write itas M C, N. 

If f is both 1-1 and onto, then we say that M is isomorphic (or 
R-isomorphic) onto N, and we write it as 4 = N. It can be easily shown 
that = is an equivalence relation in the set of R-modules. Clearly, 4 = M 
by the identity mapping. Let M = N under a mapping f. We show that the 
inverse mapping f—': N— M is an R-homomoprphism. Let y,y,,y. € N 
and a& R. Then there exist unique x,x,,x,€M such that /(x)= 
y S(x) = y), i= 1,2. Now, 


Sf" (My 2) @ SIH Se) + S02) BS (FO, + X2)) 
=X, + x,™f—'(y,) +S! (2). 


Further, 
f-' (ay) =f~' (af(X) =f! (f(ax)) = ax = af~' (y). 


Therefore, N = M under f—'. Finally, if 44 = N under a mapping /, and 
N = K under a mapping g, then / = K under the composite mapping gf. 
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3.1 Examples of R-homomorphisms 


(a) Let M be an R-module. Then the mapping i: x — x of M onto M is 
clearly an R-homomorphism of M onto M. It is called the identity endo- 
morphism of M. Similarly, the mapping 0: x > 0 of M into M is also an 
R-homomorphism of M into M; this is called the zero endomorphism of 
M. 

(b) Let M be an R-module over a commutative ring R, and let a be 
some fixed element of R. Then the mapping 


f:x-ax, xEM, 


of M into M is an R-homomorphism of M into M. 
(c) Let f: R? — R be the mapping defined by 


S(Xy,.05Xn) =X, (i fixed). 


Then f isan R-homomorphism of the R-module R’ onto the R-module R; 
this is called the projection of R" onto the ith component. 

(d) Let V be the vector space of real-valued functions of a real variable 
that have derivatives of all orders. Let D be the derivative operator. Then 
the mapping 


foDf), fev, 


of V into V is an R-homomorphism of V. 

(e) Let A be any m X n matrix over F. Consider the mapping 7: v—> 
vA of the vector space F™ to the vector space F”, where the elements of F” 
and F” are written as | x mand 1 x n matrices, respectively. Then T is 
an F-homomoprphism (i.e., linear transformation) since 


T(v, + v2) = (v, + V2 )A ™= U, At V2 A T(v,) + T(v2), 
T(av) = (av)A = a(vA) = aT(v), 


for all v,v, ,v, € F”, and aE F. 

Theorems analogous to the fundamental theorems on homomor- 
phisms and isomorphisms of groups and rings also hold for modules. But 
before we can obtain those theorems we need to define the concept of 
quotient modules (or factor modules). 

Let N be an R-submodule of an R-module M. Let a, ,a, € M. We say 
a, = a, (mod N) ifa, —a, € N. It follows immediately that # is an equiv- 
alence relation. Let a denote the equivalence class containing a € M. 
Clearly, ais the set of elements a + x with x € N, and thus ais also written 
as a + N. We denote the set of equivalence classes by 44/N (some authors 
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denote it by / — N). In M/N we define the binary operations as follows: 
a+b=at+b, a,bEMIN, 
ra = ra, aE M/N,reER. 

Following the construction of quotient groups, one can easily verify 
that these binary operations are well defined, and they make the set 4/N 
an R-module. This module is called a quotient module(or a factor module) 
of M modulo N. 

If R is a field, then M//N is called the quotient space of M modulo N. 

It is quite natural to study the structure of the quotient module 4/N: 
for example, how the submodules of M/N are related to the submodules of 
M. 


3.2 Theorem. The submodules of the quotient module M/N are of the 
form U/N, where U is a submodule of M containing N. 


Proof. Let f: M — M/N be the canonical mapping; that is, f(x) = x + N, 
x € M. Let X be an R-submodule of M/N. Consider U = (x € M|/(x) € 
X). We claim that U is an R-submodule of M. For if x,y Uand re R, 
then f(x — y) = f(x) — f(y) € Xand f(rx) = rf(x) € X, which shows that 
U is an R-submodule of M. Also, NC U, because for all x € N, f(x) = 
0€ X. Thus, N is an R-submodule of U. Also, if x € X, then there exists 
y € Msuch that f(y) = x, because fis an onto mapping. So by definition 
of U, ye U. Hence, X c f(U). Clearly, f((U) c X. Thus, X = f(U). But 
f(U) = U/N. Thus, X= U/N. O 


The proof of the following theorem on R-homomorphisms is exactly 
similar to that of the corresponding theorem for groups or rings. 


3.3 Theorem (fundamental theorem of R-homomorphisms). Let f 
be an R-homomorphism of an R-module M into an R-module N. Then 
M/Ker f= f(M). 


Proof. Consider the mapping g: M/Kerf—>/(M) given by g(m+ 
Ker f) = f(m). Along exactly the same lines as the proof of the funda- 
mental theorems of homomorphisms for groups or rings, we can show 
that g is an R-isomorphism of M/Ker fonto f(M). O 
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3.4 Theorem. Let A and B be R-submodules of R-modules M and N, 
respectively. Then 


AXB A B 


Proof. Define a mapping 


M_N 
>MXN-?—X— 
IMXN a*3 
by f(m,n) = (m+ A,n + B), mE M,nE N. It isa straightforward venfi- 
cation that fis an onto R-homomorphism. Now 


Ker f= ((m,n)|\(m + A,n + B) = (0 + A,0 + B)) 
== ((m,n)|m E A,n € B) 
=AXB. 


Therefore, by the fundamental theorem of R-homomorphisms 


MXN M_N 
AXB AB ® 


3.5 Examples 


(a) Let R be a ring with unity. An R-module & is cyclic if and only if 
M = R/I for some left ideal / of R. 


Solution. Let M = Rx be a cyclic module generated by x. Let J/= 
(re Rirx = 0). Then /isa left ideal of R. Define a mapping/: R — Rx by 
S(r) = rx, rE R. It is obvious that fis an R-homomorphism that is also 
onto. Also, Ker f= (re Rirx = 0) = J. Hence, by the fundamental 
theorem of R-homomorphisms, R// = Rx. For the converse, we note that 
the left R-module R// is generated by | + J € R//; thatis, RL + 7) = R/I. 
Hence, R// is cyclic. 


(b) Let M be an R-module. Then Hom,(M,™M) is a subring of 
Hom(M,™). 


Solution. Recall that Hom(M, MM), the set of endomorphisms of M, re- 
garded as an abelian group, is a ring (Example 3.1(c), Chapter 9). Clearly, 
Hom,(M,M)C Hom(M,M). Let fg © Hom,g(M,M), x € M, and re 
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R. Then 

(f — gXrx) = (rx) — B(rx) = f(x) — rg(x) 

= r(f(x) — g(x) = (Uf — 8X). 

Further, 

(feXrx) = f(g(rx)) = f(re(x)) = r(f(g(x))) = r(CfeX)). 
Therefore, f— g, fg € Hom,(M,M). Hence, Hom,(M, M)isasubring of 
Hom(M, ™). 


(c) Let R be a ring with unity. Let Hom,(R,R) denote the ring of 
endomorphisms of R regarded as a right R-module. Then R= 
Hom,(R,R) as rings. 

Solution. Consider the mapping f: R — Hom,(R,8), given by /(a) = a*, 
where a*(x) = ax, x E R. Let x,y,r © R. Then 

a*(x + y) = a(x + y) = ax + ay=a*(x) + a*(y), 
and 

a*(xr) = a(xr) = (ax)r = (a*(x))r. 
Thus, a* isan R-homomorphism of the right R-module R into itself; that 
is, a* € Home(R,R). 

We now show that fis a ring homomorphism. Let a,b € R. Then for 
any xE R, 

(a + b)*(x) = (a + b)x = ax + bx 

= q*(x) + b*(x) = (a* + b*\x). 

Thus, (a + b)* = a* + b*. Similarly, 

(ab)*(x) = (abXx) = a(bx) = a(b*(x)) 

= a*(b*((x))) = (a*b* Xx), 

so (ab)* = a*b*. Hence, 

f(a + b) = (a+ b)* = a*+ b* = f(a) + f(b) 
and 

S(ab) = (ab)* = a*b* = f(a) f(b). 


Finally, we show that fis both 1-1 and onto. Let a,b € R and a* = b*. 
Then a*(x) = b*(x) for all xE R. This gives ax = bx for all xE R, 
so (a- b)R=0. Then a=b. Thus, f is 1-1. Now suppose ¢€ 
Home(R,R). Let ¢ (1) = a. We claim ¢ = a*; that is, a is a preimage of ¢. 
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Now for any x € R, 

t(x) = ¢(1x) = (1)x = ax = a*(x). 
Hence, ¢ = a* as claimed. This proves that fis also an onto mapping. 


(d) Let R be a ring with unity. Let Hom ,(R,R) denote the ring 
of endomorphisms of R regarded as a left R-module. Then R® = 
Hom,(R,R) as rings. 


Solution. As in (c) we consider the mapping 
Sf: RP = Home (R,R), 


given by f(a) = a*, where a*(x) = ae x = xa. Then a* is an R-homo- 
morphism of the left R-module R into itself, and the mapping /is a ring 
isomorphism. The proof is exactly similar to the one for a right R-module 
R given in Example (c). 


We call a sequence (finite or infinite) of R-modules and R-homomor- 
phisms 


an Sats Y_, 44 y +My, Sa es cals 


exact ifIm f, = Ker f,,, for all 7. 
(e) Suppose that the following diagram of R-modules and R-homo- 


morphisms 
L 
i 
I 


is Commutative and has exact rows. Show that 


(i) Ifa,y, and f’ are 1-1, then so is £. 
(1) Ifa,y, and g are onto, then so is £. 


——eeeee f/f eee 


K 
K f 


a al 


f 8 
B 


————— = 


8 
F & 
eS 


Solution. Let m€ Ker £. Because the diagram commutes, yg(m) = 
g’B(m) = 0, so g(m) = 0 because y is 1-1. Therefore, m € Ker g = Im f 
because the top row is exact. This implies m = f(k) forsomek € K. Again, 
because the diagram commutes, f‘a=£f Thus, f‘a(k) = Bf(k) = 
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B(m) = 0. This implies k = 0, since both @ and /’ are 1-1. Then m= 
S(k) = (0) = 0, proving (i). The solution of (ii) is similar. 


Problems 


4 


If f. M— N is an R-homomorphism of an R-module M to an 
R-module N, show that 

(a) Ker fis an R-submodule of M. 

(b) Im fis an R-submodule of N. 

If A and B are R-submodules of an R-module M, show that 
(A + B)/A = B/AN Bas R-modules. 

Prove that Homz(Q,Q) = Q as rings. 

Let M be an R-submodule and x € M be such that rx = 0,r € R, 
implies r = 0. Then show that Rx = R as R-modules. 

Let V be a vector space over a field F generated by x, ,X2,...,X, 
and suppose that any relation a,x, + a,x, + °°: +a,x,=0, 
a, F, implies a, =0 =a, = ++: = a,. Show that V= F". 

Let M= K©@ K’=L© L’ be direct sums of submodules of / 
such that K = L. Show that K’ = L’. 

Suppose J is a left ideal of a ring R with unity, and R/J=R 
regarded as R-modules. Prove that J = Re for some idempotent 
eE R. 

Let N,,...,V, be a family of R-submodules of an R-module M. 
Suppose 

N,+(N,N *! NNN Nia Nt ON) = M 


for all i = 1,...,k. Show that 
k 
ou /Qn.= Min, Xo XMIN,. 


(Note that this is an analogue of the Chinese remainder theorem.) 


Completely reducible modules 


Definition. An R-module M is called simple or irreducible if RM # (0) and 
(0), M are the only R-submodules of M. (Here RM is the set of finite sums 
Zr,m,,7r,E R, m,E M. If1ER, RM=0 only ifM=0.) 


A trivial example of a simple module is a field or a division ring R 


regarded as a module over itself. 
Let R = F, be the matrix ring over a field F. Then M = Re,, isa simple 
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R-module, where e,, is a matrix unit; for, let NV * 0 be an R-submodule of 
Re, and let O * 27_,a,e, € N. Then forsome /, 1 s/s 2, ay, # 0;so az 
exists. This gives 


Rn 
ey = (arteu)( Saneu EN, 
al | 

Hence, N= Re,,, proving that M is simple. Note that is the set of 
matrices all of whose entries are zero, except possibly in the kth column. 

A curious reader might ask if a minimal left ideal in a ring R (defined in 
Section 3, Chapter 10) isa simple R-module. The answer, in general, is no. 
To see this, let A be an additive abelian group of order p, p prime. Make A 
into a ring by trivial multiplication. Then A is a minimal left ideal, but A is 
not a simple A-module because A? = 0. However, for rings with unity a 
minimal left ideal in a ring R is obviously a simple R-module. 

We now prove the following characterization of simple modules. 


4.1 Theorem. Let R be a ring with unity, and let M be an R-module. 
Then the following statements are equivalent: 


() Mis simple. 
(ii) M #(0), and M is generated by any0 #x€ M. 
(iti) M=~R/I, where I is a maximal left ideal of R. 


Proof. (i) => (11) LetO0 # x € M. Then (x) = Rxisa nonzero R-submodule 
generated by x. So M = (x). 

(i1) => (i) Let 0 # N be an R-submodule of M. Let 0 # x € N. Then by 
(11) M = (x) C N. Hence, N = M, proving that M is simple. 

We may note that, for rings without unity, (ii), in general, does not 
imply (i) (consider the earlier example of a ring A in which A as an 
A-module satisfies (11), but A is not simple). 

(i) => (iii) Because RM = M #0, there exists 0 # x E M such that 
Rx #0. But Rx ts clearly an R-submodule of M. Because M is simple, 
Rx = M. Define a mapping f: R — Rx by f(a) = ax. fis an R-homomor- 
phism of R regarded as an R-module to M. fis clearly surjective. Let 
I= Ker f. Then by the fundamental theorem of R-homomorphisms 
R/I = Rx. Because R// is simple, it follows by Theorem 3.2 that the only 
left ideal of R that contains / properly is R alone. Thus, / is a maximal left 


ideal. 
(ii1) => (i) Follows from Theorem 3.2. O 


The most important fact about simple modules is the following basic 
result. called Schur’s lemma. 
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4.2 Theorem (Schur’s lemma). Let M be a simple R-module. Then 
Hom,(M,M) is a division ring. 


Proof. Let 0 * @ € Hom,(M,M). Consider the R-submodules Ker ¢ and 
Im ¢ of M. If Ker @ = M, then d = 0, acontradiction. Thus, Ker @ = (0), 
so @ is injective. Further, if Im @ = (0), then @ = 0, a contradiction. 
Therefore, Im @ = M, which implies @ is surjective. Hence, @ is bijec- 
tive, which proves that @ is invertible. O 


Definition. An R-module M is called completely reducible if M= 
Zae,M,, where M,, are simple R-submodules. 


4.3 Theorem. Let M = ~,.,M, be a sum of simple R-submodules 
M,,. Let K be a submodule of M. Then there exists a subset A‘ of A such 
that 246M, is a direct sum, and M = K © (OX ye,-Mo). 


Proof. LetS = (AC A|Z,e,M, isa direct sum, and KN 2,.,M, = (0)). 
S * @ since O € S. (Note, by Z,.,M, we mean (0) when A is an empty 
set.) S is partially ordered by inclusion, and every chain (4,) in S has an 
upper bound U4 in S. Thus, by Zorn’s lemma, S has a maximal member, 
say A. LetN = K © (@3,.,M,,). Weclaim N = M.Solet 8 € A. Because 
M, is simple, either M, N = (0) or Mz, N= M,. Now M,N N = (0) 
implies My ©2ge,Ma = (0); 80 Zacauig) Ma is a direct sum, and it has 
zero intersection with K. But this contradicts the maximality of A. There- 
fore, for all BE A, M,C N, proving thatN=M. O 


By choosing K = (0) in Theorem 4.3, we obtain 


4.4 Corollary. If M= X,6,M, is the sum of a family of simple 
R-modules (M, )aea, then there exists a subfamily (M, )aenrca Such that 
M=@,6nMyg. 


As a consequence of the above results, we can show that every nonzero 
submodule and every nonzero quotient module of a completely reducible 
module is completely reducible (Problems | and 2). 


Problems 


1. Let Mbeacompletely reducible module, and let K be a nonzero 
submodule of M. Show that K is completely reducible. Also show 
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that K is a direct summand of M. [Hint: M= @2,.,M, and 
4JA’c A such that M = K @(@z,,,,.M,). Then 


M © SacnnMa OXaenM, 
K= ae acA a aca M ; 
© Zacn'Ma OLcnMa . ° 


2. Let M be a completely reducible module, and let K # M be a 
submodule of M. Show that M/K is completely reducible. 

3. Show that Z/(p, p,) is a completely reducible Z-module, where 
p, and p, are distinct primes. 

4. LetR = F,bethe 2 X 2 matrix ring over a field F. Show that Ras 
an R-module is completely reducible. Prove the same if R = F,, 
for any positive integer 7. Show also that if A is any left ideal in R, 
then A = Re for some idempotent é in R. 

5. Let R=(§f] and M= [84]. Show that R/M is a completely 
reducible R- (as well as R/M-) module. 

6. Let A and B be rings such that , A and ,B are completely reduc- 
ible modules. Let R = A © B be the ring direct sum of A and B. 
Show that ,R is completely reducible. 

7. Let R be a ring with unity. Show that R as an R-module is 
completely reducible if and only if each R-module M is com- 
pletely reducible. [Hint: M = 2,-,,Rx, and Rx is a homomor- 
phic image of R as a left R-module.] 


5 Free modules 


Throughout this section, unless otherwise stated, R is a nonzero ring with 
unity. 


Definition. A list - that is, a finite sequence - x,,...,.x, of elements of an 
R-module M is called linearly independent if, for any a,,....a,€ R, 
27.1 a,X,;= 0 implies a, =a,= +: =a, =0. 

A finite sequence is called linearly dependent if it is not linearly inde- 
pendent. 


A subset S of an R-module M is called linearly independent if every 
finite sequence of distinct elements of S is linearly independent. Other- 
wise S is called linearly dependent. 

An example of a linearly independent set in the F-module F[x] is 
(1,x,x?,x3,...). The set (1,x,1 + x,x?) is clearly a linearly dependent set. 

In a module M= R" over a ring R with unity, consider the set 
(€,,€2;.-.,€,}, where e, is the 7-tuple in which all components except the 
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ith are zero and the ‘th component is 1. This set (e, ,e;,...,e, } iS a linearly 
independent set in R" and is indeed a basis according to the following 
definition. 


Definition. A subset B of an R-module M is called a basis if 


(i) Mis generated by B. 
(ii) Bisa linearly independent set. 
The basis (e,,€3,....€,) of R" that we have described is called the stan- 
dard basis of R". 


Of course, not every module has a basis. For example, consider a cyclic 
group G and regard it as a Z-module. Then G has a basis if and only if G is 
infinite. For let G = (a) and ma, m € Z, be some basis element. Then 
A(ma) = 0, AE Z, must imply 4 = 0. However, if G is a finite group of 
order n, then n(ma) = 0, a contradiction. Hence, G must be infinite if it 
has a basis. But if G = (a) is an infinite cyclic group, then (a} is clearly a 
basis of G as a Z-module. This proves our assertion. 


Definition. An R-module M is called a free module if M admits a basis. In 
other words, M is free if there exists a subset S of M such that M is 
generated by S, and S is a linearly independent set. 


We regard (0) as a free module whose basis is the empty set. 


5.1 Theorem. Let M be a free R-module with a basis {e,,...,e,}. Then 
M =~ R". 


Proof. Define a mapping ¢@: M — R" by 
(3 res) Sarass > riSis 
im} im} 
where 
f, = (0,...,1,0,...,0) € R". 


Because 27_,7,e, = 27.,r;e, implies, by the linear independence of the e,’s, 
r,=r‘ for all i, @ is well defined. If m= T7_,r,e,, m’ = TL rie, and 
rE R, then it immediately follows that 

gm + m’) = d(m) + d(m’) sand —s (rm) = rd(m). 
Further, if d(m) = 0, then 27_,7,/, = 0. This implies (r,,...,7,) = 0, and 
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hence each r, = 0, proving that @ is 1-1. It is clear that ¢ is onto, and, 
hence, ¢ is an isomorphism. O 


§.2 Theorem. Let V be a nonzero finitely generated vector space over a 
field F. Then V admits a finite basis. 


Proof. Let (x, ,X2,...,X,) be a subset of V that generates V. If this set is not 
linearly independent, then there exist a, ,@2,...,.@, in F, not all zero, such 
that a,x, + a,x, + °° +a,x, =0. For simplicity we may assume 
a,*0. Then x, =f.x,+ ++ +8,x,, where 8B, = —a™'ay,...,8, = 
— ay 'a,. This shows that the set (x2,...,.x,} also generates V. If (x,...,.X,) iS 
a linearly independent set, we are done, otherwise we proceed as before 
and omit one more element. 

Continuing like this, we arrive at a linearly independent set that also 
generates V. Note that this process must end before we exhaust all ele- 
ments; in the extreme case if we had come to a single element that gener- 
ates V, then that single element will form a basis because each nonzero 
element in V forms a linearly independent set. Hence, V does admit a 


finite basis. O 


Remark. Because we regard the empty set as the basis of a zero vector 
space, Theorem 5.2 shows that each finitely generated vector space has a 
basis with finite cardinality. Indeed, it can be shown that every vector 
space has a basis, finite or infinite — a consequence of Zorn’s lemma. 


§.3 Theorem. Let M be a finitely generated free module over a com- 
mutative ring R. Then all bases of M are finite. 


Proof. Let (e,), i€ A, be a basis of M, and let (x,,x,...,x,} be a set of 
generators of M. Then each x, can be written as 
x= Saye, ayER, 
é 


and all but finite number of a@,, are zero. Thus, the set S of those e,'s that 
occur in expressions of all the x,’s, j = 1,2,...,”, iS finite. O 


5.4 Theorem. Let M be a finitely generated free module over a com- 
mutative ring R. Then all bases of M have the same number of elements. 


Proof. Equivalently, the theorem states that if R” = R*, then m =n. Let 
m<n, let d: R"™— R" be an R-isomorphism, and let y= d-'. Let 
(€; ,...5€m) and (/,,..., f,) be ordered bases of R” and R", respectively. Let us 
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write 
PE) = ay fit os tanh Isism, 
WG) = bye, + °° + Open Isfsn. 
Let A = (a;,) and B = (b,,) be n X m and m X n matrices. Then 
ype) = ) > by Aye, Lsism. 
k=1 j=l 


Thus, by the linear independence of the e,’s and by the fact that y = @7!. 
we have 


Dhan us 
This yields 
b, ot) Dp Gy, °° Am 
BA= =], 
en Bee hee, Plies, o> Whe 


where /,, is the 2 X m identity matrix. Similarly, AB = /,. Let 
A’=[A0) and B= H 


be n X n augmented matrices, where each of the 0 blocks is a matrix of 
appropriate size. Then 


‘Ro = Oe a L, 0 
A'B'=1,, — BYA E 4) 


This implies det(A’B’) = | and det(B’A’) = 0. But A’ and B’ arenXn 
matrices over acommutative ring. So det(A’B’) = det(B’A’), which yields 
a contradiction. Hence, m2n. By symmetry n2=m. This proves 
m=n. O 


Definition. The number of elements in any basis of a finitely generated free 
module M over a commutative ring R with unity is called the rank of M, 


written rank M. 
In particular, if R is a field then the rank of M is known as the dimension 


of the vector space M and is denoted by dim M. 


We call a vector space V over a field F finite dimensional if it is finitely 
generated; otherwise we call it an infinite-dimensional vector space. 
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5.5 Examples 


(a) Every finitely generated module is a homomorphic image of a finitely 
generated free module. 


Solution. Let M be an R-module with generators x,,...,.x,. Let e, be the 
n-tuple with all entries 0 except at the ith place, where the entry is |. Then 
{e,,...,€,) are linearly independent over R and generate a free module R’. 


Define a mapping @: R" — M by 


a n 
(3 res) =P 7X). 
i=! i=} 

Because each element x € R"hasa unique representation as L/_,r,e,,@ 


is well defined. 
Further, if x = 2ve,, y= L,rje,, and rE R, then it is clear that 


P(x + y) = A(x) + Ay), —- H(rx) = r P(x). 


Hence, ¢ isan R-homomorphism of R’ onto M. If K is the kernel of @, it 
follows by the fundamental theorem of homomorphisms that R"/K = A/. 


(b) Let V be a vector space over a field F with a basis (e,),-,. Prove the 
following: 


(i) V=OXeqFe, = OXeaF,, Fi = F. 
(ii) Vis completely reducible. 
(i) If Wis a subspace of V, then there exists a subspace W’’ such that 
V= WOR". 
(iv) Suppose]A]< ©, and let (e,,...,e,) be a linearly independent sub- 
set of V’. Show that there exists a basis of V containing (@,,...,e;)}. 
(This is also true without the assumption that |A| < ©.) 


Solution. (i) V = ©%,-,Fe, follows from the fact that the set (e,),<, gener- 
ates Vand is a linearly independent set. Next, it 1s clear that the mapping 
a» ae,, a & F, is a bijective linear mapping from F to Fe, regarded as 
vector spaces over F. Thus, F = Fe,, proving (1). 

(ii) Because ’ = @3,.,F;, F; = F, and F is a simple F-module, it 
implies that V is completely reducible. 

(i11) Because Vis completely reducible, the result follows by Theorem 
4.3. 

(iv) Let W = @S*_, Fe,. By (iii) V = W © W’ for some subspace W’’ of 
V. Let (€,4,---:@,) be a basis of W’, Then (@, ,...,€¢ 0g +9 €n) IS a basis of 
V. 


268 


Modules and vector spaces 


Problems 


l. 
2. 
3. 


6 


Let R be a commutative ring with unity, and let e * 0, | be an 
idempotent. Prove that Re cannot be a free R-module. 

Prove that the direct product M, X ::: X M, of free R-modules 
M, is again free. 

Let (x,)ie, be a basis of a free R-module M. Prove that M = 


ZienRX, ' 


Consider the diagram of R-modules and R-homomorphisms 
F 
| f 


M—e N ——~> 0 
8 


with exact row; that is, gis surjective. Suppose F is free. Show that 
there exists an R-homomorphism h: F —~ M such that gh =. 
Prove that ,Q is not a free module. Can you generalize this to pK, 
where D is a commutative integral domain with unity and K is its 
field of fractions? 

Show that every ideal of Z is free as Z-module. 

Show that every principal left ideal in an integral domain R 
with unity is free as a left R-module. 

Show that every module is a homomorphic image of a free 
module. 

An onto homomorphism g: A — B from an R-module A to an 
R-module B is said to split if Kerg is a direct summand of A. 
Show that if B is free and g: A — B is an onto homomorphism, 
then g splits. 


Representation of linear mappings 


In this section F denotes a field. We show that to each linear mapping @ 
from a vector space V over F to a vector space U over F there corresponds 
a matrix A with respect to bases @ and ¢ of Uand V, respectively. Further, 
if A’ is the matrix of d with respect to another pair of bases @’ and @’ of U 
and V, respectively, then A’ = P-'AQ for some invertible matrices P and 
Q. Matrices A and A’ = P~'AQ are called equivalent. 

Let U and V be vector spaces over a field F with ordered bases @ = 
(€; ,€2,...,&m) and € = (f,, f,...,f,), respectively. Let @: V —> Ube a linear 
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mapping given by 


Hf) = > Q,je;, j= 1.2.57, 
i=] 


where a,, € F. Then the m X n matrix A = (a,,) is called the matrix of the 
linear mapping ¢ with respect to the given bases B and €. Conversely, any 
m Xn matrix A = (a,,) over F determines a unique linear mapping 


Wf) = > Q;;€;, J= 1,...,7, 
im] 


of V to U whose matrix is A. 

Thus, @ @ A isa 1-1 correspondence between Hom -(V,U) and F™™". 
Indeed this correspondence @ < A preserves addition and scalar multi- 
plication; that is, if @<*A and wy B, then 6+ wA+t+B and 
ad <> aA, where a € F. Therefore we have 


6.1 Theorem. Hom ,(V,U) = F™*" as vector spaces over F. 


If V = U in Theorem 6.1, the correspondence @ = A also preserves 
multiplication. This yields that the algebra of n X m matrices over a field F 
is, abstractly speaking, the same as the algebra of linear mappings of an 
n-dimensional vector space V. We record this in 


6.2 Theorem. Hom,(V,V) = F, as algebras over F, where dim 
V=n. 


Our purpose now is to consider how a change of basis of a vector space 
affects the matrix of a linear mapping. Let @ = (e, ,é3,...,e,,) and B’ = 
(€} ,€3,....€m) be two ordered bases of U. Then each e; can be written asa 
unique linear combination of e,’s, say 


m 
ej eee > Pij&i5 J= 1,...,2M, (1) 


im} 
where p,, © F. The m X m matrix P = (p,,) is called the matrix of trans- 
formation from 8’ to &. Similarly, each e, is a unique linear combination 
of e;’s, say 


m 
g= > pyei, j= l,m, (2) 
i=] 


where p/,€ F. Then the m X m matrix Q = (pj,) is called the matrix of 
transformation from & to ®’. 
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We proceed to show that PQ = / = QP. Substituting (1) fore; in (2), we 
obtain 


e=> Pi & Put, j= l,..,m. (3) 


Because the e, are linearly independent, we equate coefficients in (3) and 
get 

> Py Pu = Ox: 

i=} 


Hence, QP=/J/. Similarly, PQ=/J. On the other hand, let @= 
(€, ,€25...5€_) be a basis of U, and let P=(p,,) be an m X m invertible 
matrix. We claim that @’ = (e},e3,...,e,,), where 


ej =: > Pijeis J = 1,...,7, (4) 


io} 
is also a basis of U. For, let 
ae; +ae,t- ta,e,,=0, aEéEF. 


Substituting for e; from (4), we obtain 
m m m 
Q, Dy Pie + A, > Pi2@t °° +a, > Dime; = 9. 
ae bad | i=} im} 


Because @, ,€2,...,€,, are linearly independent, we get 


QA) Diy + Dat °°* + AmnDPim = 9, i= 1,...,m. 


Therefore, 
4 4 
4 5} a 
P| = /=0, — so = (), 
a, a, 


because P is invertible. This proves our claim that (e{ ,e3,...,/,) 1S also a 
basis of U. 

We are now ready to prove the following theorem, which describes the 
effect of a change of a basis on the matrix of a linear mapping. 


6.3 Theorem. Let U and V be vector spaces over F of dimensions m 
and n, respectively. Let A = (a,) be the matrix of a linear mapping $: 
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V — Uwith respect to a given pair of ordered bases B = (€, ,€3,...,€m_) and 
€ = (/,,f2,.f,) of U and V, respectively. Then 


(i) The matrix of & with respect to a new pair of bases 8’ = 
(€),€3,...€,,) and 6’ = (1, f3,...,f%) of U and V, respectively, is 
P~'AQ, where P and Q are the matrices of transformations from 
8’ to B and €' to ©, respectively. 

(ti) For any given pair of invertible matrices P and Q of sizes m X m 
and n X n, respectively, there is a basis B' of U and €' of V such 
that the matrix of d with respect to B’ and @' is P~'AQ. 


Proof. (i) We have 
 (f;) = 5 Aye; Je L.gm. 


ad | 


Let A’ = (aj,) be the matrix of ¢ with respect to bases ®’ and ¢’. Then 


OUM)=Sayel, jalan. (1) 
(=| 


Since Q = (q,) and P~' = (p/,) are the matrices of transformations from 
€’ to € and B to B’, we have 


a 
{j= >> DujSes jJ= eee 
k=| 


and 
e,= > Pues, i= |,...,m. 
Hence, ~ 
dUM=d (3 awJe) = > dub fe) 
= > dy > One, 
- > Uj > Dn > Pie}. 
That is, 


df= p> ( >») Pandy) ef. (2) 


k=1 /=} 
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Because @€},€3,...,€,, are linearly independent, (1) and (2) yield 


R 7m 
a= > »> Pu ik Ary. 


k=| i=) 
Therefore, A’ = P—'AQ. 
(ii) The proof follows from the discussion preceding the theorem 
and (i). O 


6.4 Example 

Let d: F? — F? be the linear mapping given by ¢(a,,c) = (a + 6 +¢, 
b + c). Find the matrix A of @ with respect to the standard bases of F? 
and F?. Also, find the matrix A’ of @ with respect to the bases 8’ 
((— 1,0,2),(0,1,1),(3,— 1,0)) and @’ = ((—1,1),(1,0)) of F? and F?, re- 
spectively. Verify that A’ = P~'AQ, where Pand Qare asin Theorem 6.3. 


Solution 


G(1,0,0) = (1,0) = 1(1,0) + 0(0,1), 
H(0,1,0) = (1,1) = 1(1,0) + 1(0,1), 
(0,0,1) = (1,1) = 1(1,0) + 1(0,1). 


So A = (j | !). Further, 


P(— 1,0,2) = (1,2) = 2(— 1,1) + 3(1,0), 
G(0, 1,1) = (2,2) = 2(— 1,1) + 4(1,0), 
G(3,— 1,0) = (2,— 1) = — 1(— 1,1) + 10,0). 


Thus, A’ = [3 2 —}]. We now find the matnx P7!: 


(1,0) = O(— 1,1) + 10,0), 
(0,1) = 1(— 1,1) + 11,0). 


So P~' = [9 !). Similarly, 


=—[ Oo 3 
Q= 01 -! 
2 ! 0 


By actual computation, 
2 2° 1 
1 = = 4’ 
P-'AQ E 4 1 A’. 


Problems 


1. Let V be the vector space of polynomials of degree less than or 
equal to 3 over a field F. Let D be the differentiation operator on 
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V. Choose bases of V 
B, = (1,x,x*,x3) and B= (1,1 + x,(1 + x},(1 + x)’). 


(a) Find the matrix of D with respect to 8. 
(b) Find the matrix of D with respect to 2,. 
(c) What is the matrix of transformation from &, to 2,? 


2. Let V be the vector space generated by a set B of linearly 
independent differentiable functions from R to R and let D 


denote the differential operator -. Find the matrix of the 
x 
linear mapping D: V— V with respect to a basis B, where 


(a) B= {e~*,e**}, 

(b) B= {e*, xe*}, 

(c) B= {1,x,x?}, 

(d) B={1,x,e7,e7*,e7*}, 
(e) B= {sin x, cos x}. 


3. Let V = R? be the vector space over the reals. Let 7 be the linear 
mapping defined by 7(x,y) = (2x,2x + 3y). Choose 8, and 8, to 
be the standard basis and {(cos a, sin a),(— sin a, cos a@))}, respec- 
tively, of V. 

(a) Find the matrix of 7 with respect to &,. 

(b) Find the matrix of transformation from #, to #, with 
respect to the standard basis. 

(c) Find the matrix of 7 with respect to ®). 


7 Rank of a linear mapping 


In this section we introduce the concept of rank, which is of fundamental 
importance in linear algebra. It has wide applications: in particular, it 
gives the condition for consistency of a linear system of equations over a 
field, and it is a tool for reducing a matrix to canonical form. 


Definition. Let U and V be finite-dimensional vector spaces over a field F, 
and let @: VU be a linear mapping. The dimension of the subspace 
Im @ is called the rank of the linear mapping @. 


Let F" denote the set of n-tuples over F. The elements of the vector 
space F’" will be written as 1 X n or sometimes as n X 1 matrices inter- 
changeably. The ambiguity of the meaning of the notation F” should not 
cause any confusion because the context always makes it clear. 

Let A be an m X n matrix over a field F. The rows of A can be looked 
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upon as members of the vector space F”. The subspace of F” generated by 
the rows of A is called the row space of A, denoted by R(A). 


Definition. The row rank of a matrix A is the dimension of the row space 
R(A). 


We know that to each linear mapping there corresponds a class of 
equivalent matrices (Theorem 6.3). The next theorem gives a connection 
between the rank ofa linear mapping and the row rank of the correspond- 
ing class of matrices. 


7.1 Theorem. Let @: V — U be a linear mapping. Then 
rank @ = row rank A, 


where A is any matrix of o. 


Before we prove the theorem, we prove a series of lemmas. 


7.2 Lemma. Let A be an m X n matrix. Then 
R(A) = (xA|x € F™). 
Proof. Let A,,...,A,, denote the rows of A, and x =(x,,...,x,, EF”. 
Then 
A, 


XA = [xX Xe . = x,A, a of Xue. 


Am 


Thus, every element of R(A) isa linear combination of the rows of A, and, 
conversely, any linear combination of the rows of A is ofthe form xA. O 


7.3 Lemma. Let A be an m X n matrix over F. and let P be an inverti- 
ble m X m matrix over F. Then R(A) = R( PA). 


Proof. Let xA € R(A). Then xA = (xP~'\PA) € R(PA). Conversely, let 
XPA € R(PA). Then xPA =(xP)A€E R(A). O 


7.4 Lemma. Let A be an m X n matrix over F, and let Q beannXn 
invertible matrix over F. Then dim R(A) = dim R(AQ). 
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Proof. Let (u, A,...,u,A)} be a basis of R(A). Because Q is invertible, the set 
(u,AQ,...,.4,AQ) is trivially a linearly independent set of vectors in 
the vector space R(AQ). Further, if uAQ € R(AQ), then uA € R(A) 
and, hence, uA =a,u,A+ °*: +a@,u,A, a,;€ F. Therefore, uAQ= 
a,u,AQ+ °*> +a,u,AQ. This proves that {u, AQ.,...,u,A4Q) is a basis of 
R(AQ). O 


7.5 Lemma. Let A be an m X n matrix over F. Let P and Q, respec- 
tively, be m X mand n Xn invertible matrices. Then 


dim R(A) = dim R(PAQ). 


Proof. Follows by Lemmas 7.3 and 7.4. O 


7.6 Lemma. Let ¢: V-U be a linear mapping with rank $ =r. 
Then there exist bases B and B’ of V and U, respectively, such that the 
matrix of @ with respect to B and B®’ is 


oo} 


Proof. Choose a basis of Ker @ and complete it to a basis B = (v),...,v,, 
V4.1 9:++sUm) Of V, where (v,4,;,...,U,,) is a basis of Ker @ [Example 5.5(b)]. 
We claim that the list O(v,),....@(v,) is linearly independent. For if 
a, (v,) + --: +a,d(v,) = 0, then a,v, + -*- +a,v, € Ker d, so 


av, + ne + av, = Bre Ure + at + Bi Um 


for some £8, € F. Then by the linear independence of (v,,...,v,,), we obtain 
that each a, as well as each £, is zero. 

Further, since Im @ is generated by ¢(v,),....0(v,,), out of which 
b(v,+),---:O(v,,) are all zero, and the remaining 7 are linearly indepen- 
dent, it follows that {@(v, ),...,.6(v,)) is a basis of Im @ C U. Again com- 
plete the linearly independent set {(v,),....0(v,)) to a basis @’ = 


(D(v, ),---,O(U,),W1 >--W,) Of V. 
It is clear that the matrix of @ with respect to bases B and @’ is 


I, 0 
ie 


We remark that the proof of Lemma 7.6 contains the following useful 
result. 
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7.7 Rank nullity theorem. If: V— U is a linear mapping from a 
vector space V to a vector space U, then 


dim V = dim Ker @ + dim Im @. 
(dim Ker @ is called the nullity of $.) 


Proof of Theorem 7.1. Let 


I, 0 

4-[5 9] 

be a matrix of d. The assertion of the theorem is true for A. If B is another 
matrix of @ with respect to a different pair of bases of V and U, then by 


Theorem 6.3, B = PAQ for some invertible matrices P,Q. This implies 
r= dim R(A) = dim R(PAQ) = dim R(B) (Lemma 7.5). O 


Similarly, we can prove that the subspace of F” generated by the 
columns of the matrix A is also of dimension r. This subspace is called the 
column space of A. 


Definition. The rank of a matrix is defined to be the common value of the 
dimension of the row space and that of the column space of the matrix. 


Thus, the rank ofa linear mapping ¢ is the same as the rank of a matrix 
A of ¢. 

We conclude with the remark that, in practice, the computation of the 
rank of a matrix is done by performing elementary row and column 
operations on the matrix (see Chapter 20 for the computation of the rank 


of a matrix over a PID). 
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Find the rank of the linear mapping ¢: R‘ — R°, where 
0(a,b,c,d) = (a+ 2b —c+d4,—3a+ b+ 2c—d,—3a+ 8b+c+ 4). 


Solution. First we find the matrix of ¢ relative to standard bases of R‘ and 
R>,. Now 
(1,0,0,0) = (1,—3,—3) = 1(1,0,0) — 3(0,1,0) — 3(0,0,1), 
(0,1,0,0) = (2,1,8) = 2(1,0,0) + 1(0,1,0) + 8(0,0,1), 
(0,0,1,0) = (— 1,2,1) = — 1(1,0,0) + 2(0,1,0) + 1(0,0,1), 
(0,0,0,1) = (1,— 1,1) = 1(1,0,0) — 1(0,1,0) + 1(0,0,1). 
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Thus, the matrix of @ is 


lL 2. 
A=j;-3 1 2 = 
=3 l l 


Let R,,R,,and R,; denote the first, second, and third rows, respectively, of 
A. Because 3R, + 2R, — R; = 0, it follows that R,, R,, and R; are lin- 
early dependent. However, straightforward computations yield that if 
aR, + BR, =0, a, BE R, then a= 0 = £. Thus, R, and R; are linearly 
independent. Hence, the maximal number of linearly independent rows 
of the matrix A is 2, proving that rank @ = 2. 


Problems 
1. Find the rank of linear mappings ¢: R" — R” in each of the 
following cases: 
(a) n=4,m=3; 


d(a,b,c,d) 
= (2a — b+ 3c + d,a — 8b + 6c + 8d,a + 2b — 2d). 


(b) n=5,m=4, 


@(a,b,c,d,e) = (2a + 3b + c+ 4e3a+b+2c—dt+e, 
4a— b+ 3c — 2d — 2e,5a + 4b + 3c — d+ 6e). 


2. (a) LetAand Bbetwom X nmatrices over a field F. Show that 
rank(A + B) = rank A + rank B. 


(b) Let A and B be m Xn and n Xk matrices, respectively. 
Show that 


rank(AB) s min(rank A, rank B). 


[Hint: rank(AB) = row rank(AB) = dim(F”AB) s dim(F'B) 
= rank B. 


Similarly, column rank(AB) = dim(ABF‘*), etc.]} 
3. Let A and B be two n x n matrices over a field F. Show 


(a) rank(AB) > rank A + rank B—n. 
(b) If A = A’, rank A + rank(J — A) =n. 
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Let S,,S3,...,5, be subspaces of F". Show 


k 
dim(S,;NS20---AS,)> } dim S,—(k — 1)n. 
i= 


Show rank A =rank(A'A), where A is a matrix over reals. 
Generalize this result over the field of complex numbers. 
Let A be an n x n nilpotent matrix over a field F. Show 


rank A + rank A" <n. where A”™*! =0. 


PART IV 


Field theory 


CHAPTER 15 


Algebraic extensions of fields 


1 irreducible polynomials and Eisenstein criterion 


Let F be a field, and let F [x] be the ring of polynomials in x over F. We 
know that F[x] is an integral domain with unity and contains F as a 
proper subring. A polynomial f(x) in F[x] is called irreducible if the 
degree of f(x) = 1 and, whenever /(x) = g(x)h(x), where g(x),A(x) € 
F{x], then g(x) € F or h(x) € F.If a polynomial is not irreducible, it is 
called reducible. 

We remark that irreducibility of a polynomial depends on the nature of 
the field. For example, x? + | is irreducible over R but reducible over C. 


1.1 Properties of F[x} 
We recall some of the basic properties of F [x]. 


(i) The division algorithm holds in F [x]. This means that if f(x) € 
F [x] and 0 # g(x) € F[x], then there exist unique g(x), r(x) € 
F [x] such that f(x) = g(x)q(x) + r(x), where r(x) = 0 or degree 
r(x) < degree g(x). 
(11) F[x]is a PID (Theorem 3.2, Chapter 11). 
(il) F(x] 1s a UFD (Example 1.2(b), Chapter 11). 
(iv) The units of F[x] are the nonzero elements of F. 
(v) If p(x) is irreducible in F[x], then F[x}/(p(x)) is a field, and 
conversely. 


We now proceed to prove some results for testing whether a polyno- 
mial is reducible or irreducible. 
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1.2 Proposition. Let f(x) € F[x] be a polynomial of degree >1. If 
f(a) = 0 for some a € F, then f(x) is reducible over F. 


Proof. By 1.1(1) we can write f(x) = (x — «)q(x)+r, where reF. Then 
0=f(a)=r, so f(x) =(x—a)q(x). Because f(x) has degree >1, 
q(x) € F; so f(x) is reducible over F. O 


Definition. Let E be a field containing the field F, and let f(x) € F[x]. 
An element a€ E will be called a root or a zero of f(x) if f(a) = 0. 
(If f(x) = ag t+ a,x + ++: +.a,x*, then f(a) stands for the element ay + 
a,at -: +4a,0¢ in E.) 


The converse of Proposition 1.2 holds for a certain class of polyno- 
mials, as given in the following proposition. 


1.3 Proposition. Let f(x) € F[x] be a polynomial of degree 2 or 3. 
Then f(x) is reducible if and only if f(x) has a root in F. 


Proof. If f(x) is reducible, then f(x) = f,(x) (x), where f,(x) and f(x) are 
nonconstant polynomials, each of which has degree less than 3. But this 
implies that either f(x) or f,(x) must be of degree 1. Let f(x) = ax+ b, 
with a # 0. Then f,(— ba~') = 0 and, hence, f(— ba~') = 0, which proves 
that — ba™' is a root of f(x). O 


Recall that a polynomial f(x) € Z[x] is called primitive if the greatest 
common divisor of the coefficients of f(x) is 1. 


Definition. A polynomial ag + a,x + +*+ +a, x" over a ring R is called 
monic ifa, = 1. 


Clearly, every monic polynomial f(x) € Z[x] is primitive. The follow- 
ing lemma was proved earlier (Lemma 4.2, Chapter 11). 


1.4 Lemma. /ff(x), 2(x) © Z[x] are primitive polynomials, then their 
product f(x) g(x) is also primitive. 


A polynomial f(x) € Z[x] is called irreducible over Z if f(x) is an 
irreducible element in Z[x]. An irreducible polynomial over Z must be 
primitive. 
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1.5 Lemma (Gauss). Let f(x) € Z[x] be primitive. Then f(x) is reduc- 
ible over Q if and only if f(x) is reducible over Z. 


Proof. If f(x) is reducible over Z, then, trivially, f(x) is reducible over Q. 
Conversely, suppose f(x) is reducible over Q. Let f(x) = u(x)v(x) with 
u(x), v(x) € Q[x] and u(x) ¢ Q, v(x) ¢ Q. Then f(x) = (a/b)u’(x)v’(x), 
where u’(x) and v’(x) are primitive polynomials in Z[x]. Then bf(x) = 
a(u’(x)v’(x)). The g.c.d of coefficients of bf(x) is b, and the g.c.d. of the 
coefficients of au’(x)v’(x) is a, by Lemma 1.4. Hence, b = +a, so f(x) = 
+ u’(x)v’(x). Therefore, f(x) is reducible over Z. O 


Note. deg u’(x) = deg u(x) and deg v’(x) = deg v(x). 


For polynomials f(x) € Z[x] that are not necessarily primitive, we 
have 


1.6 Lemma. [ff(x) € Z[x] is reducible overQ, then it is also reducible 
over Z. 


Proof. The proof exactly parallels the proof of the corresponding result in 
Lemma 1.5. O 


If f(x) € Z[x] has a root in Q, we can prove the stronger result 


1.7 Theorem. Let f(x) = ay +a,;x+ °° +a,_,x"'+x"€ Z[x] 
be a monic polynomial. If f(x) has a root aE Q, then aE Zand aldo. 


Proof. Write a = a/fB, where a,8 € Z and (a,8) = 1. Then 
a an! a? 
ag + a,(2) i a,-1( =) + Ba = 0. 
Multiply the above equation by 8”~' to obtain 


a 
Ao B"~' + a,aB"-2 + + +a,_,a""'= Fe 


Because af € Z, it follows that a"/f € Z, so B must be +1. The last 
equation also shows aja,. Hence, a= ta€Zandala,. O 
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1.8 Theorem (Eisenstein criterion). Let f(x) ™ ay) +a,x+ °° + 
a,x"& Z[x] n=1. If there is a prime p such that p? Jao, plao, 
Pla,,....PI@n-1, DY an, then f(x) is irreducible over Q. 


Proof. Suppose 
S(X) = (Bg + Oy x + 20+ +B x cg + c,x+ °° +0,x°), 

with 5,,c,E Z, b, #0 #c,, r<n, and s <n. Then ay = bocy and a, = 
b,c,. Then since p|ay and p? J do, either plby and p J co, or pic, and p J by. 
Consider the case p|Co but p by. Because p J a,, it follows that p / b, and 
pIc,. Let c,, be the first coefficient in co + +++ +0¢,x* such that pc... 
Then note that a,, = Doc, + 0,C,—, + °** +.,,Co. From this we see that 
p J a,, (otherwise, plc,,), Som =n. Then n = ms 5 < n, which is impos- 
sible. Similarly, if p]by and p co, we arrive at an absurdity. Hence, by 
Lemma 1.6, f(x) is irreducible over Q. O 


1.9 Examples 
(a) x? — 2 is irreducible over Q. 


Solution. Follows from the Eisenstein criterion by setting p = 2. 


(b) O (x)= 1+x+ +++ +x?" is irreducible over Q, where p is 
prime. 


Solution. Write ®,(x) = (x? — 1)/(x — 1). Let 
&(x) = D(x + 1) =((x + 1? — 1)/((x + 1) — 1) 


~ L(+ (Pari a +(? .)x) 
mxrt+(P)xr4 _ +(? R 


Note that p divides all the coefficients except that of x?—', and p? does not 
divide the constant term. Thus, g(x) is irreducible over Q. Hence, ®,(x) is 
also irreducible over Q [for reducibility of ®,(x) implies reducibility of 


g(x)]. 


Problems 
1. Show that x? + 3x + 2 E€ Z/(7)[x] is irreducible over the field 
Z/(7). 


2. Show that x‘ + 8 € Q[x] is irreducible over Q. 
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Show that x3 — x — 1 € Q[x] is irreducible over Q. 

Show that x? + ax? + bx + 1 € Z[{x] is reducible over Z if and 

only if either a= bora+b=-—2. 

5. Determine all (a) quadratic, (b) cubic, and (c) biquadratic irre- 
ducible polynomials over Z/(2). 

6. Determine which of the following polynomials are irreducible 

over Q. 

(a) x?>—5x+ 10. 

(b) x4 — 3x2+9. 

(c) 2x5 — 5x4+4+5. 


~~ 


2 Adjunction of roots 


Definition. If F is a subfield ofa field E, then E is called an extension field 
of F or simply an extension of F. 


If E is an extension of F, we denote this fact by the diagram 
| 
F 


If E is an extension of F, then trivially E is a vector space over F. The 
dimension of E over F is usually written as [E: F]. 


Definition. Let E be an extension of F. Then the dimension of E considered 
as a vector space over F is called the degree of E over F. 


Therefore, the degree of E over F is written as [E: F]. If [E: F] < ©, 
then E 1s called a finite extension of F (or simply finite over F’); otherwise 
E is called an infinite extension of F. 


2.1 Theorem. Let FC ECK be fields. If [K:E]<@ and 
[E:F] < ©, then 


(i) (K:F]<o. 

(ii) (K:F]) =(K:E)(E:F]. 
Proof: Let (v, ,..-,0,) be a basis of K over E, and let (w, ,...,w,,) be a basis of E 
over F. If ue K, then 


u= Sav, a,€E. (1) 


im} 
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Because a, € E, 


a=Ybyw, ob,EF. (2) 
j=l 
Substituting the a,’s from (2) into (1), we get 


m an 
u=P Sdn, by EF. 
in| jot 
This shows that the mn elements v,w,, i = 1,...,7,/ = 1,...,m, form a set of 
generators of the vector space K over F. 
This proves (i). To prove (ii), we show that the mn elements v,w, are 
linearly independent over F. Suppose we have an equation 


Yd cyyw,=0, cyeF. 
j=] j=) 
We can rewrite this as 
m a 
> 4%=0, whered,= ScywEk, iw ,..,m. 
i=] j=l 
Because the p,’s are linearly independent over E, the last equation gives 
A, = 0, i= 1,...,7; 
that is, 
yd cyw=0, i= 1,2,...,m. 
j=l 
Because c,, € F and the w,’s are linearly independent over F, the last 
equation gives 
Cy, = 0, for i = l,....m and j = 1,...,7, 
which shows that the mn elements u,w,, i = 1,...,m,j ™ 1,...,.7, Of K form a 
basis of K over F. This proves (ii). O 


We recall that a 1-1 homomorphism of a field F into a field E is called 
an embedding of F into E. 


2.2 Lemma. Let E and F be fields, and let 0: F — E be an embedding 
of F into E. Then there exists a field K such that F is a subfield of K anda 
can be extended to an isomorphism of K onto E. 
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Proof. Let S be a set whose cardinality is the same as that of E — o(F) 
( = the complement of o(F) in E) and that is disjoint from F. Let f be a 
1-1 correspondence from S to E — o(F). Set K = F US. Then we can 
extend the embedding o: F — E to a mapping o*: K —~ E as follows: 
o*(a)=o(a) if aE F, o*(a)=f(a) if aE S. Clearly, o* is a well- 
defined, I-1, onto mapping. We now define a field structure on K. If 
x, y € K, we define 


x + y= (a*)"'(a*(x) + o*(y)), 
xy = (o*)"'(0*(x)a*(y)). 
Our definitions of addition and multiplication coincide with the given 
addition and multiplication of elements of the original field F, and it is 
clear that F is a subfield of K. Hence, K is the desired field. O 


In view of the above lemma, if a is an embedding ofa field Fin a field E, 
we can identify F with the corresponding image o(F) in E and can con- 
sider E as an extension of F. To express this identification of F with o(F), 
we write x in place of o(x) foreach x € F. Henceforth, whenever there isan 
embedding of a field F into a field E, we say that E is an extension of F. 


23 Theorem. Let p(x) be an irreducible polynomial in F(x]. Then 
there exists an extension E of F in which p(x) has a root. 


Proof. Because p(x) is irreducible in F [x], (p(x)) is a maximal ideal in 
F [x]; so E = F[x]/(p(x)) is a field. As explained in the previous para- 
graph, we identify each a € F with its coset a = a + (p(x)) and regard Eas 
an extension field of F. 

Let 


P(X) ™ ag ta,xt --° +a,x" 


Then p(x) = 0 in E; that is, ag + a,n+ -*: +a,x"=0; that is, a) + 
axt+--> +a,x"=0, where x=x+(p(x))EE; that is, agt+ 
a,x + °*+ +a,x"=0, which shows that the element x in E is a root of 
p(x) € F[x]. O 


2.4 Corollary (Kronecker theorem). Let f(x) € F[x] be a noncon- 
stant polynomial. Then there exists an extension E of F in which f(x) hasa 
root. 
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Proof. If f(x) has a zero in F, then we take E = F. Suppose f(x) has no 
zero in F. Let p(x) be an irreducible factor of f(x) in F [x] and set E = 
F[x]/(p(x)). Then by Theorem 2.3, E is an extension field of F in which 
p(x), and hence, f(x) has a root. O 


Let p(x) be an irreducible polynomial in F [x] having a root, say u, in an 
extension field E of F. We denote by F(u) the subfield of E generated by F 
and u that is, the smallest subfield of E containing F and u. Consider the 
mapping @: F[x] — E defined by 


O(by + b,x + oes + b,x") = bo + but ed + b,,u™, 


where bp + b,x + °°: + b,x" € F[x]. Obviously, @ is a homomor- 
phism whose kernel contains p(x), because p(u) = 0. We show that Ker 
dh = (p(x)). 

Because F[x] is a PID, Ker() = (g(x)) for some g(x) € F[x]. Then 
p(x) € Ker dimplies p(x) = g(x)h(x), for some h(x) € F[x]. Because p(x) 
is irreducible over F, h(x) € F. Thus, Ker @ = (g(x)) = (p(x)). 

By the fundamental theorem of homomorphisms, 


F[x)/(p(x)) = image of @ 
= {bo + bu oe 88s: b,,u™ = E|bo 
+ bxt+ 0° +5,.x"& F[x]) 
= F[u}, say. 


Because F[x]/( p(x)) isa field, the set F[u] is a field. Obviously, F[u] is the 
smallest subfield of E containing F and u, so F(u) = F[u]. If the degree of 
p(x) is n, then u cannot satisfy any polynomial in F [x] of degree less than 
n. This shows that the set 


(1, u, u?,...,4"7") 


forms a basis of F(u) over F, and [F(u): F] = n. 
We summarize these results in 


2.5 Theorem. Let p(x) be an irreducible polynomial in F[x] and let u 
be a root of p(x) in an extension E of F. Then 


(i) F(u), the subfield of E generated by F and u, is the set 
F[u] = (bp + but --+ +b,u™ E Elby + byx+ ++ +b,,x" © F[x]); 


(ii) Ifthe degree of p(x) isn, the set (1, u,....u"~') forms a basis of F(u) 
over F: that is, each element of F(u) can be written uniquely as 
Cotcyut +: +c,u"—', wherec,€ F and[F(u):F) =n. O 
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2.6 Examples 


(a) Consider the irreducible polynomial x? + 1 € R[x]. If u is a root of 
x? + 1 in some extension K of R, then the field R(u) = {a + bula,b € R) 
contains all the roots of x? + 1. 


Solution. Note that the existence of K follows from Theorem 2.3. Further, 
by Theorem 2.5, R(u) is a vector space over R and has a basis {1,u). Thus, 
R(u) = (a + bulja,b € R). Clearly, if uisa root of x? + 1, then—wisalsoa 
root. Hence, R(u) contains both the roots of x? + 1. (uis generally written 
as V— 1, and R(u) is called the field of complex numbers denoted by C.) 


(b) Consider the irreducible polynomial x? + x + 1 € Z/(2)[x]. Ifuisa 
root of p(x) in some extension K of Z/(2), show that the subfield Z/(2)[u] 
of K has four elements. 


Solution. By Theorem 2.5, elements of Z/(2)[u] are a + bu, where a,b € 
Z/(2). Thus, Z/(2)[u] = (0,1,u,1 + u), where u? + u + 1 =0, isa field of 
four elements. 


Problems 


1. Show that p(x) =x? —x-— 1 € Z/(3)[x] is irreducible over 
Z/(3). Show that there exists an extension K of Z/(3) with nine 
elements having all roots of p(x). 

2. Show that x* — 2 € Q[x] is irreducible over Q. Find (if it exists) 

an extension K of Q having all roots of x?—2 such that 

[K:Q] = 6. 

Find the smallest extension of Q having a root of x‘ — 2 € Q[x]. 

Find the smallest extension of Q having a root of x? + 4 € Q[x]. 

Let f,(x),...,/,,(x) be a set of polynomials over a field F. Show that 

there exists an extension E of F in which each polynomial has a 

root. 


aa es 


3 Algebraic extensions 


Definition. Let E be an extension of F. An element a€ E is said to be 
algebraic over F if there exist elements ao,...,a,(n = 1) of F, not all equal to 
0, such that 

Agtajat +++ +a,a"= 0. 


In other words, an element ae E is algebraic over F if there exists a non- 
constant polynomial p(x)eF [x] such that p(a«) = 0. 


290 Algebraic extensions of fields 


3.1 Theorem. Let E be an extension field of F, and let uE E be 
algebraic over F. Let p(x) € F[x] be a polynomial of the least degree such 
that p(u) = 0. Then 


(i) p(x) ts irreducible over F. 
(it) Ifg(x) © F[x] is such that g(u) = 0, then p(x)|g(x). 
(iii) There is exactly one monic polynomial p(x) € F[x] ofleast degree 
such that p(u) = 0. 


Proof. (i) Let p(x) = p,(x)p2(x), and deg p,(x), deg p.(x) be less than 
deg p(x). Then 0 = p(u) = p,(u) * p2(u). This gives p,(u) = 0 or p,(u) = 0; 
that is, u satisfies a polynomial of degree less than that of p(x), a contra- 
diction. So p(x) is irreducible over F. 

(11) By the division algorithm g(x) = p(x)g(x) + r(x), where r(x) = Oor 
deg r(x) < deg p(x). Then g(u) = p(u)g(u) + r(u); that is, r(u) = 0. Be- 
cause p(x) is of the least degree among the polynomials satisfied by u, r(x) 
must be 0. Thus, p(x)|g(x). 

(111) Let g(x) be a monic polynomial of least degree such that g(u) = 0. 
Then by (ii) p(x)|g(x) and g(x)| p(x), which gives p(x) = g(x) since both 
are monic polynomials. O 


Definition. The monic irreducible polynomial in F(x] of which u is a root 
will be called the minimal polynomial of u over F. 


Definition. An extension field E of F is called algebraic ifeach element of E 


is algebraic over F. 
Extensions that are not algebraic are called transcendental extensions. 


3.2 Theorem. If E is a finite extension of F, then E is an algebraic 
extension of F. 


Proof. Letu € Eand[E: F) = 72. Then (1,...,u"} must be a linearly depen- 
dent set of elements of E over F. Thus, there exist ay ,@, ,...,4, (not all zero) 
in F such that ap + a,u+ °:- +a,u"=0. Thus, u is algebraic over F. 
This proves the theorem. O 


As a consequence of Theorems 3.2 and 2.5, we have 


3.3 Theorem. Jf E is an extension of F andu & E is algebraic over F, 
then F(u) is an algebraic extension of F. 
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Proof. If p(x) is the minimal polynomial of u over F and degree p(x) is n, 
then by Theorem 2.5, [F(u):F] =n < oo. Therefore, by Theorem 3.2, 
F(u) is an algebraic extension of F. O 


Not every algebraic extension is finite, as the following example shows. 


3.4 Example 


Consider the field E = Qcv2, V3....,Vp,...), the smallest subfield of the field 
of real numbers containing the rationals Q and the square roots of all the 
positive primes. We claim that 


Qc Q(v2) Cc Q(v2,V3) Cc +: 


is an infinite properly ascending chain of subspaces over Q. In 
other words, each new adjunction of the square root of a prime is a 
proper extension. Let p,,...,.p, be m distinct positive primes. Set F 
Q(vp,,....¥p,). If is any other prime not equal to any p,, i = 1,...,”, then 
we show un ¢ F. We prove this by induction on n. Letn = 0. Then F = Q. 
Then by a proof analogous to the classical proof that 12 ¢ Q, we can show 
vq ¢ Q, so the result holds for n = 0. 

Assume now that the result is true for 7 — 1 primes and suppose that 
F =Q(vp,,....VD,-1, Vp,). Set Fo = we ,-..,VD,-1). By induction the 
result holds for Fy; in addition, F = Fo(vp,) is an extension of Fy of degree 
2. If possible let Vg © F. Then 


Vq=atbvp,, a,b€ Fo. 
This gives 
q= a? + bp, + 2abvp,, 


which implies that Vp, € Fp, a contradiction. Hence, vq ¢ F, as asserted. 
Therefore, 


Qc Q(Vv2) C Q(V2,V3) C + 


is an infinite properly ascending chain. 

Hence, E = Q(v2,V3....,Vp,...) is not a finite extension of Q. Further, if 
a€ E, then there exists a positive integer r such that-a € Q(vp,, 
vp,,...,¥p,), which is clearly a finite extension (in fact of degree 2’) of Q. 
Thus, @ is algebraic over Q. Hence, E is an algebraic extension of Q, and 
[E:Q] is infinite. 
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Definition. An extension E of F is called finitely generated if there exists a 
finite number of elements u, , Uz,...,U, in E such that the smallest subfield of 
E containing F and (u, ,u,...,U,) is E itself. We then write E = F(uy,...,U,). 


A finitely generated extension need not be algebraic. 


3.5 Example 


Let F [x] be a polynomial ring over a field F in a variable x. Consider the 
field of quotients E of F [x]. The elements of E are of the form 


(Ag tax --* +4,,X%™)/(byp + Ox + --* +5,x"), 


where a,, b, € F and not all b, are zero. Thus, E is generated by x over F: 

that is, E = F(x). Clearly, by the definition of a polynomial ring, x cannot 

be algebraic over F. Hence, E is not an algebraic extension. 
However, we have the following 


3.6 Theorem. Let E = F(u,,...,u,) be a finitely generated extension of 
F such that each u,, i = 1.,...,r, is algebraic over F. Then E is finite over F 
and, hence, an algebraic extension of F. (The case for r = | was proved in 
Theorem 3.3.) 


Proof. Set E, = F(u,,...,4,), | Ss is r. Observe that if an element in E is 
algebraic over a field F, then, trivially, it is algebraic over any field K such 
that E > K > F. Therefore, each u, is algebraic over E,_,, i = 1,...,7, with 
E, =F. Also, E, = E;- ,(u,). Therefore, by Theorem 2.5, [E,:E,-~ ,] 1s 
finite, say n;. By Theorem 2.1, 

[E: F] = (E: FE, ,J[E,-1:£,-2) °** (E,:F]; 
hence, 

[E:F] =n,n_, °** ny. 
Thus, E is a finite extension of F and therefore algebraic over F. O 


An application of Theorem 3.6 gives 


3.7 Theorem. Let E be an extension of F. If K is the subset of E 
consisting of all the elements that are algebraic over F, then K is a subfield 
of E and an algebraic extension of F. 
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Proof. We need only show that if a,b € E and are algebraic over F, then 
a + b, ab, and a/b (if b # 0) are also algebraic over F. This follows from 
the fact that all these elements lie in F(a,5), which, by Theorem 3.6, is an 
algebraic extension of F. 

Thus, K is an algebraic extension of Fin E. O 


The subfield K in Theorem 3.7 is called the algebraic closure of F in E. 
In the next section we define an algebraically closed field as one that has 
no proper algebraic extensions, and we show that for any given field F 
there exists a unique (up to isomorphism) algebraic extension F that is 
algebraically closed. This F is also called the algebraic closure of F, with- 
out any reference to its being in another field such as the field K already 


defined. 
We next prove an important result that we often use in the following 


sections. 


Definition. Let K and L be extension fields of a field F. Then a nonzero 
homomorphism (hence an embedding) 0: K — L, such that o(a) = a for 
all ae F, is called an F-homomorphism of K into L or an embedding of 


K in L over F. 


The above terminology is consistent with the earlier definition of an 
F-homomorphism of vector spaces, for we can regard K and L as vector 
spaces over F, and then if o is a nonzero homomorphism of K into L (as 
fields) with o(a) = a for all a € F, then o(ax) = a(a)o(x) = aa(x). 


3.8. | Theorem. Let E be an algebraic extension of F, and leto: E— E 
be an embedding of E into itself over F. Then o is onto and, hence, an 
automorphism of E. 


Proof. Let a € E, and let p(x) be its minimal polynomial over F. Let E’ 
be the subfield of E containing F and generated over F by all roots of p(x) 
that lie in E. Thus, E’ is generated over F by a finite set of elements in E 
that are algebraic over F. So by Theorem 3.6, E’ is a finite algebraic 
extension of F. Furthermore, o must map a root of p(x) onto a root of p(x) 
and, hence, maps E’ into itself. Now o(E’) = E’, because a is 1-1. Thus, 
[o(E’): F] = [E’:F]. But since o(E£’) is a subspace of E’, it follows that 
o(E’) = E’. This implies that there exists 6 € E’ such that o(5) = a. 
Hence, a is onto, which completes the proof of the theorem. O 
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Problems 


1. 
2. 


10. 


12. 


Let FC KC Ebe three fields such that Kis an algebraic extension 
of Fanda € E isalgebraic over K. Show that a is algebraic over F. 
Prove that ¥2 and V3 are algebraic over Q. Find the degree of 


(a) Q(,/2) over Q. 

(b) Q(,/3) over Q. 

(c) Q(,/2, /3) over Q. 

(d) Q(./2+./3) over Q. 

Determine the minimal polynomials over Q of the following 
numbers: 


(a) /24+5. (b) 3,/2+5. () V-14+,/2. 
(d) /2-—3,/3. 


Find a suitable number a such that 

(a) Q(./2,./5)=Q(a). (b) Q(./3, 1) = Q(a). 

Let E be an extension of F, and let a,b € E be algebraic over F. 
Suppose that the extensions F(a) and F(d) of F are of degrees m 
and n, respectively, where (7,n) = 1. Show that [F(a,b):F] = 
mn, 

If E is an extension field of F and [E: F] is prime, prove that there 
are no fields properly between E and F. 

Let E be an extension field of F. If a € E has a minimal polyno- 
mial of odd degree over F, show that F(a) = F(a?). 

Let x” — a € F[x] be an irreducible polynomial over F, and let 
bE K be its root, where K is an extension field of F. If m is a 
positive integer such that m|n, find the degree of the minimal 
polynomial of 5” over F. 

Let F be a field, and let F(x) be a field of rational functions. Let 
L > Fbeasubfield of F(x). Prove that F(x) is an algebraic exten- 
sion of L. 

Give an example of a field E containing a proper subfield K 
such that E is embeddable in K and [E:K] is finite. 


Let wyatog + isin an and u= ee. 
n n n 
Show that [Q(w):Q(u)] =2. 


Let D be an integral domain and F be a field in D such that 
[D:F] < oo. Prove that D is a field. 


Algebraically closed fields 295 
4 Algebraically closed fields 


Definition. A field K is called algebraically closed if it possesses no proper 
algebraic extensions - that is, if every algebraic extension of K coincides 
with K. 


4.1 Theorem. For any field K the following are equivalent: 


(i) K is algebraically closed. 
(ii) Every irreducible polynomial in K(x] is of degree }. 
(iti) Every polynomial in K(x] of positive degree factors completely in 
K[x] into linear factors. 
(iv) Every polynomial in K[x] of positive degree has at least one root 
in K. 


Proof. (i) = (ii) Let p(x) € K[x] be an irreducible polynomial of degree 7. 
Then by Theorems 2.3 and 2.5, there exists a finite (hence algebraic) 
extension E of K such that [E: K ] = n. But since K is algebraically closed, 
E=K,.son= 1. 

(ii) => (i) Let E be any algebraic extension of K, and let a € E. Then the 
minimal polynomial of a over K is irreducible and, therefore, has degree 1}. 
Therefore, a € K and, hence E = K. The equivalence of (ii), (iii), and (iv) 
is obvious. O 


An example of an algebraically closed field is the field of complex 
numbers. Note that the algebraic closure of the field of real numbers R in 
the field of complex numbers C is C itself. 


Definition. /f F is a subfield of a field E, then E is called an algebraic 
closure of F if 


(i) Eis analgebraic extension of F. 
(ii) E is algebraically closed. 


The following is an easy fact, and its proof is left as an exercise. 

If F is a subfield of an algebraically closed field K, then the algebraic 
closure F of F in K is also algebraically closed. 

Next we show that any two algebraic closures of a field F (if they exist) 
are isomorphic under a mapping that keeps each element of F fixed. The 
proof of the existence of an algebraic closure F of F is given at the end. 

We first prove a key lemma. 
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4.2 Lemma. Let F be a field, and let o: F — L be an embedding of F 
into an algebraically closed field L. Let E = F(a) be an algebraic exten- 
sion of F. Then o can be extended to an embedding n: E — L, and the 
number of such extensions is equal to the number of distinct roots of the 
minimal polynomial of a. 


Proof. Let p(x) = dy + a,x + ++ +x" be the minimal polynomial of a 
over F. Let 


p*(X) = a(ag) + o(a,)x + +> +x"E Lx]. 


Let BS be a root of p°(x) in L. Recall that if is algebraic over a field F, then 
a typical element of the field F(a) can be written uniquely as by + 
bat +: +b,a*, where k < degree of the minimal polynomial of a 
over F, and 6, € F, i = 1,,...,k. 

Define 7: F(a) — L by the rule 


n(dy + bya + +++ + byar*) = a(bp) + 0(b,)B + +++ + 0(d,)B". 


Then nis a well-defined mapping. Routine computation shows that 7 is 
a homomorphism. Thus, 7 is an embedding of F(a) into L, and it extends 
ao. Clearly, there isa 1-1 correspondence between the set of distinct roots of 
p*(x) in L (hence, between the set of distinct roots of p(x) in its splitting 
field over F) and the set of embeddings 7 of F(a) into L that extend o. This 
proves the last assertion. O 


43 Theorem. Let E be an algebraic extension of a field F, and let a: 
F — L be an embedding of F into an algebraically closed field L. Theno 
can be extended to an embedding n: E — L. 


Proof. Let S be the set of all pairs (K,@), where K is a subfield of E 
containing F, and @ is an extension of o to an embedding of K into L. If 
(K,@) and (K’,@’) are in S, we write (K,@) s (K’,@’) if KC K’ and @’ 
restricted to K is 8. Because (F,a) € S, S # ©. Also, if ((K,,0,)) isa chain in 
S, we set K = UK, and define 0 on Kas follows. Let a € K. Thena € K, for 
some i, and we define {a) = 6,(a). @ is well defined. Let a € K, and 
a & K,. Because either K, C K,; or K; C K, by definition ofa chain in S, we 
get 0,(a) = 8a). Hence, 6 is well defined. Then (K,@) is an upper bound 
for the chain {(K,,6,)}. Using Zorn’s lemma, let (K,7) be a maximal ele- 
ment in S. Then 7 is an extension of a, and we contend that K = E. 
Otherwise, there existsa € E,a ¢ K. Then by Lemma 4.2 the embedding 
n. K — Lhasan extension 7*: K(a) — L, thereby contradicting the maxi- 
mality of (K,n). Hence, K = E, which proves the theorem. O 
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We are now ready to show that any two algebraic closures of a field are 
isomorphic. 


4.4 Theorem. Let K and K’ be algebraic closures of a field F. Then 
K = K’ under an isomorphism that is an identity on F. 


Proof. Let A: F — K be the injection; that is, A(a) = a for all a € F. By 
Theorem 4.3, A can be extended to an embedding A*: K’ — K. Now 
K’ = A*(K’). Hence, A*(K’) is also an algebraically closed field containing 
F. Because K is an algebraic extension of F, K is also an algebraic exten- 
sion of A*(K’), which lies between F and K. But then A*(K') = K, so J* is an 
isomorphism of K’ onto K, as desired. O 


Theorem 4.4 shows that an algebraic closure of a field F is unique up to 
isomorphism. Henceforth, we denote by F the algebraic closure of F. 

Now we want to show the existence of an algebraically closed field K 
containing a given field F and algebraic over it. The proof of this result, as 
given in the following theorem, is due to Artin and needs the concept of a 
polynomial ring in an infinite number of variables. There are other 
proofs, but all of them need some additional concepts that are not dis- 
cussed in this book. For the sake of completeness we have chosen to give 
the proof by Artin. For those who find the proof involved, they may skip it 
at the first reading; this will not impede their understanding any of the 
following material. 

We first outline the concept of a polynomial ring in an infinite number 
of variables over a field F. 

Let F be a field, and let S = (x,),<, be an infinite set of commuting 
indeterminates or variables. Then the elements of the form 


DXi, Xi, 7 Xi «= aE FLX, ES, 
finite 
with natural addition and multiplication form a ring F[S)} called the 


polynomial ring over F in S. Note that for a polynomial 2a,x;,,x;,---x;,, to 
be zero each coefficient a, of each monomial x,, --- x, must be zero. 


4.5 Theorem. Let F be a field. Then there exists an algebraically 
closed field K containing F as a subfield. 


Proof. Let us first construct an extension K of Fin which every polynomial 
in F [x] of degree = | has a root. Let S be a set in 1-1 correspondence with 
the set of all polynomials in F[x] of degree = 1, and suppose to each 
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polynomial f= /(x) € F[x] of degree = 1 we correspond the element 
x a S. 

Form the polynomial ring F[S]. We claim that if A is the ideal in F[S] 
generated by all polynomials /(x,) of degree = 1 in F[S], then A # F[S]. 
Otherwise | € A, and, hence, 


BAX) + + +enS(x,)=1, with g, € F[S). (1) 


Now 2;,...,2, Will involve only a finite number of variables. Write 
Xy, =X, for each f, € F[x]. After reindexing, we can assume that x, = 
Xj y..,Xy = X,, and the variables occurring in all g,, 1 s/s n, are in the 
Set (X,,Xp,....X poe Xm} Ehus, we can rewrite (1) as 


») BX peer Xm f(%)) = 1. (2) 


Let E be an extension of F in which each polynomial f,,..., f, has a root 
(Problem 5, Section 2). Let a, be aroot offin E, i = 1,...,. Put x, = a, for 
i= 1,.....and x,=0 fori=n + l,...,m in relation (2) and obtain 0 = I, 
which is absurd. Thus, A * F[S]. Then by Zorn’s lemma, A can be em- 
bedded in a maximal ideal M of F[S]. It is easy to see that F[S]/M is a field 
containing a copy of F. Hence, F[S]/M/ can be regarded as an extension of 
F. Also, each polynomial f€ F[x] of degree = | has a root in F[S]/M. 
Therefore, we have constructed a field K,, namely, F[S]/M, that is an 
extension of F and in which every polynomial f(x) € F[x] of degree 2 | 
has a root. 

Inductively, we can now form a chain of fields K, C K,C K;C... 
such that every polynomial in K,[x] of degree = | hasarootin K,,,. Let 
K = Us_,K,. Then K is a field, and every polynomial in K[x] has its 
coefficients in some subfield K,, and, hence, has a root in K,,, C K. This 
completes the proof. O 


4.6 Theorem. Let F be a field. Then there exists an extension F that is 
algebraic over F and is algebraically closed; that is, each field has an 
algebraic closure. 


Proof. Let K be an algebraically closed field containing F. If Fis the set of 
all elements of K that are algebraic over F, then, by Theorem 3.7, F is an 
algebraic extension of F. Let /(x) € F[x]. Then f(x) has a root a€ K 
because K is algebraically closed. But then a € K is algebraic over F, and 
because F is algebraic over F, we obtain, by Problem | in Section 3, that 
ais algebraic over F. Hence, a € F. Thus, F is algebraically closed, which 
proves that F is an algebraic closure of F. O 
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It is proved in Chapter 18 that every polynomial in C[x] has a root in C 
(fundamental theorem of algebra). This is equivalent to saying that C = 
C. Further, the tower 


C 
I 
R 
| 
R 


with [C:R] = 2 shows that R = C. The algebraic closure Q of Q in C is 
called the field of algebraic numbers. Q is a countable set, so it is strictly 
contained in C (Theorem 5.3, Chapter 1). 


CHAPTER 16 


Normal and separable extensions 


] Splitting fields 


Definition. Let f(x) be a polynomial in F [x] of degree = 1. Then an exten- 
sion K of F is called a splitting field of f(x) over F if 


(i) f(x) factors into linear factors in K[x]; that is 
f(x) = (x — a) (x—@,), a, EK. 


(ii) K= F(q,,....@,); that is, K is generated over F by the roots 
Q,,.-.,, Of f(x) in K. 


For example, (i) the field Q(V2) = {a + bV2|a,b € Q) isa splitting field 
of x? — 2 € Q[x] over Q; (ii) a splitting field of x? + 1 € R[x] over R is the 
field C. 

We note that a polynomial f(x) € F[x] always has a splitting field, 
namely, the field generated by its roots in a given algebraic closure F of F. 


1.1 Theorem. If K is a splitting field of f(x) € F[x] over F, then K isa 
finite extension and, hence, an algebraic extension of F. 


Proof. Because K = F(q,,...,@,) and each a@,, / = 1,...,”, is algebraic over 
F, the proof follows from Theorem 3.6 of Chapter 15. O 


1.2 Theorem (uniqueness of splitting field). Let K be a splitting field 
of the polynomial f(x) € F [x] overa field F. If E is another splitting field of 
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J (x) over F, then there exists an isomorphism o: E — K that is identity on 
F. 


Proof. Let K be an algebraic closure of K. Then K is algebraic over K. 
Because K is also algebraic over F, it follows that K is algebraic over F. 
Hence, K = F. Because E is an algebraic extension of F (Theorem 1.1), 
we get, by invoking Theorem 4.3 of Chapter 15, that the identity mapping 
4: F — F can be extended to an embedding o: E — K. Let f(x) =a) + 
a,xt+e1+ +a,x" & F[x]. Set f?(x) = a(ao) + o(a,)x + +++ + 0(a,)x". 
Because a is identity on F, f°(x) = f(x). Now we have a factorization 


L(x) = C(x — 01) +++ (X — Oe) 
with a; € E, i = 1,...,2, c € F. From the fact that f(x) = /?(x) and c € F, 
we obtain 

I(x) = C(x — a(a,)) ++ (x — o(@,)), 


a unique factorization in K [x]. But since f(x) has a factorization in K[x]. 
Say 


I(x) = c(x — Bi) ++ (x — B), 
where £,€ K, i= 1.,...,n, it follows that the sets {a(@,),...,0(@,)} and 
{ B,,...,8,) are equal. Thus, 

K = F(B,,....8,) = F(o(Q,),.-.,0(Q@,)) = O( F(a, ,...,,)) = O(E). 


Hence, ¢ is an isomorphism of EF onto K. O 


Theorem 1.2 proves that the splitting field of a polynomial over a given 
field is unique (up to isomorphism) if it exists. But recall that any field F 
has an algebraic closure F that contains roots of all polynomials over F. 
Thus, the intersection of all subfields of F containing all the roots of a 
given polynomial f(x) € F[x] is the splitting field of f(x) over F. 


1.3 Examples 
(a) The degree of the extension of the splitting field of x? — 2 € Q[x] is 6. 


Solution. By Eisenstein’s criterion x? — 2 € Q[x] is irreducible over Q, 
and indeed it is the minimal polynomial of 2'7. Thus, Q[x]/(x? — 2) = 
Q(2'/7) with [(Q(2'3):Q] = 3. Clearly, Q(2'/3) is not the splitting field 
of x? — 2 € Q[x], since x3 — 2 = (x — 2'4)\(x? + 2'3x + 27/3) implies 
x3 — 2 has two complex roots, say a and a. Thus, p(x) = x? + 2'3x + 


302 Normal and separable extensions 
27/3 € Q(2"3)[x] is irreducible over Q(2'/?). Hence, 
Q(2")[x]/(p(x)) = Q(2'? a) = Q(2"?,a), 


and the degree of extension of Q(2'3,a) over Q(2'/?) is 2, the degree of 
p(x). Because Q(2!3,a) contains one root a of p(x), it will also contain the 
other root a. Hence, Q(2',q) is the splitting field of x? — 2 € Q[x] over 
Q. Finally, 


[(Q(2'4,a): Q] = Side Q(2/7)}[Q(2'?):Q] 


This completes the solution. We exhibit the bases of extensions in the 
following diagram: 


Q(2'8 a) Q(2'3 a) 
| basis {1, a} 
Q2!/3) basis (1, 2'/3, 22/3, 
| basis (1, 2!/3, 27/3) 2 "/3 | 22/3) 
Q Q 


(b) Let p be prime. Then f(x) = x? — 1 € Q[x] has splitting field Q(a), 
where a # | and a? = 1. Also, [(Q(a): Q] = p— 1. 


Solution 

S (x) = (x — W)(xPT + xem 2 + ee tx + 1), 
We know 

P(x) = xP + +) +x4+1 E Q[x] 


is irreducible over Q. Let a be a root of p(x) in the splitting field of f(x) 
over Q. Then, clearly, a? = | anda ¥ 1. Weassert that 1,a,a@?,...,@°7! are 
p distinct roots of f(x). Clearly, a? = 1 implies (a’ = 1 for all positive 
integers i. Thus, we need to show that 1,a,a?,....a°~' are distinct roots. 
Note that if m is the smallest positive integer such that a” = 1, then m|p. 
Thus, m = p. Hence, no two roots in the list 1,a,...,@°~' can be equal, 
whence these are all the p roots of x? — 1. Hence, the splitting field of 
x? — 1 € Q[x] is Q(a). Because the minimal polynomial of @ is p(x), 
which is of degree p — 1, we get [Q(a): Q] = p — 1, which completes the 
solution. 


We prove later that the degree of extension of the splitting field of 
x" — 1 E Q[x] is d{n), where @ is Euler's function. 
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(c) Let F = Z/(2). The splitting field of x? + x? + | € F[x] isa finite 
field with eight elements. 


Solution. Because x} + x? + 1 has degree 3, we know this is reducible if 
and only if it has a root in F. But by actual substitution neither 0 nor | isa 
root. Thus, x? + x? + 1 is irreducible over F. Let a be a root of this 
polynomial in its splitting field. Then we find 


x34 x24 1 = (x + ax? + (1 +a)x + (a+ a?)) 
=(x+aXxt+a2Xx+1+a+ a’). 


Therefore, F(a) is the splitting field of x3>+x?2+1 over F, and 
[F(a): F] = 3, the degree of the minimal polynomial x? + x? + | of a. 
Furthermore, F(a) has a basis (1,a@,a@?} over F. Therefore, 


F(a) = {0,1,a,a2,a + 1,a? + 1a? +a,a?+ a+ 1}, 
where a? + a? + 1 =0. This completes the solution. 


(d) The splitting field of f(x) = x4 — 2 € Q[x] over Q is Q(2'",i), and 
its degree of extension is 8. 


Solution. f(x) = x4 — 2 € Q[x] is irreducible over Q. We find f(x) has a 
root 2'/4. So f(x) is the minimal polynomial of 2'“ over Q. Thus, 


[Q(2'*): Q) = 4, the degree of f(x). 


Now f(x) = (x — 2"/4)(x + 2'/4)(x? + 2") and x? + 2'/? is irreducible 
over Q(2'/*). Thus, the root 2!“ i has x? + 2'/ as its minimal polynomial 
over Q(2'/*), whence 


[Q(2'*)(2"/47): Q(2"*)] = 2. 
Because Q(2'/4)(2'/47) = Q(2'/4,/) is clearly the splitting field and 
[Q(2"4,i): Q) = [(Q(2",/): Q(2"*)Q(2"*): Q) = 4° 2 = 8, 
we have completed the solution. We illustrate it in the following diagram: 


Q(2!/4 i) 
basis (1, 2!/4 ;) 
| basis (1, 2'/4, 21/2, 23/4) 
Q 
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Problems 


1. Construct splitting fields over Q for the following polynomials. 
(a) x?—1. (b) x*4+1. (c) x®—1. (d) (x?—2)(x3—3). 
Also find the degrees of extension over Q. 

2. Construct a splitting field for x° + x + 1 € Z/(2)[x] and list all 
its elements. 

3. Find conditions on a and b such that the splitting field of x3 + 
ax + b E Q[x] has degree of extension 3 over Q. 

4. In Problem 3 find conditions on a and b such that the degree of 
extension is 6 over Q. 

5. Let E be the splitting field of a polynomial of degree n over a field 
F. Show that [E: F] s n! 

6. Let f(x) € F[x] be a polynomial of degree n = | over F. Show 
that f(x) has exactly n roots in its splitting field. (A multiple root 
occurring with multiplicity m is counted m times.) 

7. Ifan irreducible polynomial p(x) over a field F has one root in a 
splitting field E of a polynomial f(x) € F[x], then p(x) has all its 
roots in E. 

8. Show that over any field K > Q the polynomial x? — 3x + 1 is 
either irreducible or splits into linear factors. 

9. Let f(x) = x* — 2x? —2eQ[x]. Find the roots «, 8 of f(x) such 
that 


Q(a) = Q(A). 
What ts the splitting field of f(x)? 


2 Normal extensions 


Let (f;(x)),<, be a family of polynomials of degree = | over a field F. In the 
previous section we defined the splitting field of a polynomial over F. 
Now by a splitting field of a family (/;(x)),-, of polynomials we mean an 
extension E of F such that every /,(x) splits into linear factors in E[x], and 
E is generated over F by all the roots of the polynomials f(x), i € A. If Ais 
finite and our polynomials are /,(x),..., /,(x), then their splitting field is a 
splitting field of the single polynomial f(x) = /\(x) -:: f,(x), obtained by 
taking the product. The proof of uniqueness (up to isomorphism) of a 
splitting field ofa single polynomial can be extended to prove the unique- 
ness (up to isomorphism) of a splitting field of a family of polynomials 
over a given field. 

The next theorem proves a set of equivalent statements for an exten- 
sion E of F to be a splitting field of a family of polynomials over F. 
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2.1 Theorem. Let E be an algebraic extension of a field F contained in 
an algebraic closure F of F. Then the following conditions are equivalent: 


(i) Every irreducible polynomial in F(x] that has a root in E splits 
into linear factors in E. 
(ii) Eis the splitting field of a family of polynomials in F [x]. 
(iii) Every embedding o of E in F that keeps each element of F fixed 
maps E onto E. (In other words, o may be regarded as an auto- 
morphism of E.) 


Proof. (i) = (11) Leta € E, and let p,(x) be its minimal polynomial over F. 
By (i), p,(x) splits into linear factors in E. Thus, it follows immediately 
that EF is a splitting field of the family (p,(x)), a € E. 

(11) => (ii) Let (/,(x)), i € A, be a family of polynomials of which E is 
the splitting field. Ifa isa root ofsome/,(x) in E, then for any embedding o 
of E into F that keeps each element of F fixed, we know o(q) is a root of 
f(x). Because E is generated by the roots of all the polynomials f(x), it 
follows that o maps E into itself. Thus, by Theorem 3.8 of Chapter 15, ais 
an automorphism of E. 

(i11) = (i) Let p(x) € F [x] be an irreducible polynomial over F that has 
a root a€ E. Let BE F be another root of p(x). We show that BE E. 
Because a and £ are roots of the same irreducible polynomial p(x), we 
have F-isomorphisms 


F(a) = F[x]/(p(x)) = F(B). 


Let o: F(a) — F(B) be the isomorphism given above. Then o(a) = £ 
and o(a)= a for all ae F. By Theorem 4.3 of Chapter 15, o can be 
extended to an embedding o*: E — F. But then, by (iii), o* is an auto- 
morphism of £; hence, o*(a) = o(a)=£8E E. This completes the 
proof. O 


Definition. An extension E of a field F is called normal if E satisfies any 
one of the equivalent statements of Theorem 2.1. 


2:2 Examples of normal extensions 


(a) C is a normal extension of R. 

(b) R is not a normal extension of Q, for x? — 2 € Q[x] is irreducible 
over Q and has a root V2 in R, but it does not split into linear factors in R 
because it has complex roots. 

(c) Ifa = cos(z/4) + i sin(7/4), then Q(aq) is a normal extension of Q. 
This follows from the fact that Q(q) is the splitting field of x* + 1 © Q[x]. 
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(d) In general, any extension E of a field F, such that [E: F] = 2, is 
a normal extension. Let a € E, a ¢ F. Let p(x) be the minimal poly- 
nomial of q@ over F. Then [F(a):F] = degree of p(x). Because 
[E: F(a)|[F(a):F] =[E:F] =2, we must have [E:F(a)]=1 and 
[F(a):F] = 2. Thus, E = F(a), and the degree of p(x) is equal to 2. Be- 
cause p(x) has one roota € E, it must have its other root also in E. Hence, 
E is the splitting field of p(x) € F[x] and is thus a normal extension of F. 


2.3 Example 


Let E be a finite extension of F. Then E is a normal extension of F if and 
only if E is a splitting field of a polynomial f(x) € F[x]. 


Solution. By hypothesis E = F(a,,...,@,), where a, € E are algebraic over 
F. Let px) be the minimal polynomial of a, over F. 

Assume first that E is a normal extension of F. Then p,(x) splits in E 
because it has one root a, € E. Thus, f(x) = p,(x) °°: p,(x) € F[x] has all 
roots in E. Because E = F(q,,...,@,) and @, ,...,@, are some of the roots of 
J(x), E must be the splitting field of f(x). The converse follows from 
Theorem 2.1. 


Problems 
1. Which of the following extensions are normal over Q? 


(a) Q(./-2).  (b) Q6/7). © QA/=D. 

(d) Q(x), where x is not algebraic over Q. 

Is R(V—5) normal over R? 

Let E be a normal extension of F and let K be a subfield of E 

containing F. Show that E is a normal extension over K. Give an 

example to show that K need not be a normal extension of F. 

4. Let F=Q(V2) and E = Q(V2). Show that E is a normal exten- 

sion of F, F is a normal extension of Q, but E is not a normal 

extension of Q. 

Show that every finite extension of a finite field is normal. 

6. Let E,,ieA, be a family of normal extensions of a field F in some 
extension K of F. Show that () E, is also a normal extension 


of F. me 

7. Show that the field generated by a root of x° — x — 1 over Q is 
not normal over Q. 

8. Find the smallest normal extension (up to isomorphism) of 
Q(2!/*) in Q (the algebraic closure of Q). 


wn 
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9. Find the smallest normal extension (up to isomorphism) of 


Qc2!/4 31/4) j in QO. 
10. Let F,E, and K be fields such that F © E ¢ K and K is normal 
over F. Show that any F-homomorphism 


co: E-K 


can be extended to an F-automorphism of K. 
11. Let Ec F and KcF be normal extensions of a field F. Show 
that the subfield L generated by E and K is also normal over F. 


3 Multiple roots 


The purpose of this section is to discuss the multiplicity of roots of a 
polynomial over a field. For this purpose we introduce the concept of the 
derivative of a polynomial. 

Let f(x) = 27.,a,x' be a polynomial over a field F. We define the 
derivative of f(x) by f’(x) = 27, ia,x‘—". Then properties of derivatives 
that are familiar from calculus are not necessarily valid here. For example, 
J’ (x) = 0 does not always imply that f(x) is a constant: for example, if we 
set f(x) = x? in a field of characteristic 5 then /’(x) = 5x‘ = 0. 

The ordinary rules for operating with derivatives, however, remain the 
same. It 1s easy to verify that taking the derivative is a linear operation; 
that 1s, we have 


3.1 Theorem. (af(x) + bg(x))’ = af’(x) + bg’(x), where a,b & F. 


For the derivative of a product we have the usual rule. 


3.2 Theorem. (/(x)g(x))’ =f’(x)g(x) + S(x)g’(x). 


The proof follows easily from the definition of a derivative. 

Let K be a splitting field of a polynomial f(x) € F [x]. Let a be a root of 
J (x). Then (x — a@)| f(x) in K[x]. If (x — a)‘ is the highest power of (x — a) 
that divides f(x) in K[x], then sis called the multiplicity of a. Ifs = 1, then 
a 1s Called a simple root; if s> 1, then « is called a multiple root. 


3.3 Theorem. Let f(x) € F[x] bea polynomial of degree = 1 withaas 
a root. Then a is a multiple root if and only if f’(a) = 
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Proof. Because a is a root of f(x), by the division algorithm we can write 


S(x) = (x — adg(x). 


Then {’(x) = (x — a)g ’(x) + g(x). Clearly, a is a multiple root of (x) if 
and only if g(a) = 0. Because /’(a~) = g(a), the proposition follows. O 


3.4 Corollary. Let f(x) be an irreducible polynomial over F. Then f(x) 
has a multiple root if and only if f(x) = 0. 


Proof. By Theorem 3.3, ais a multiple root of f(x) ifand only if it is a root 
of f’(x). Because f(x) is irreducible, a7 'f(x) is the minimal polynomial of 
a over F, where a is the leading coefficient of f(x). Thus, if /’(x) # 0, 
degree of f’(x) 2 degree of a~'f(x), acontradiction. Hence, f(x) =0. O 


3.5 Corollary. Any irreducible polynomial f(x) over a field of charac- 
teristicO has simple roots. Also any irreducible polynomial f(x) over a field 
F of characteristic p #0 has multiple roots if and only if there exists 
g(x) € F[x] such that 


I(x) = 8(x?). 


Proof. Let f(x) = 279a,x' be an irreducible polynomial over a field F. By 
Corollary 3.4. f(x) has multiple roots if and only if /(x)= 
2%, ia,x'~' = 0. This implies ia, = 0, 1 s i sn. Ina field of characteris- 
tic 0 this gives a, = 0, | Si sn. But then f(x) = ay € F, acontradiction. 
Thus, f(x) has all roots simple if F is of characteristic zero. In case F is of 
characteristic p # 0, and if a, # 0, we must have pji. This implies f(x) 
2a,x', where either a; =0 or pji. But then f(x) = g(x?) for a suitable 
polynomial g(x) € F[x]. O 


The following is an interesting fact regarding the multiplicity of roots of 
an irreducible polynomial over any field. 


3.6 Theorem. Jff(x) © F[x] is irreducible over F, then all roots of f(x) 
have the same multiplicity. 


Proof. Let F be the algebraic closure of F, and let aand f be roots of f(x) in 
F with multiplicities k and k’, respectively. We know that 


F(a) = F[x)/(/(x)) = F(B) 
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by 


Agta,at --: +a,a"—anota,x 
+e) $a,x"+ (f(x) ay t+a,Bt+ ++ +4,f". 


Let us denote this isomorphism of F(a) to F(Z) by oa. Clearly, o(a) = £. 
Consider the diagram 


° a 
F ee F(B) 


F(a) ———_——» Fi) 
We know a can be extended to an isomorphism o* from F to Fi) = F. 
Then o* induces a ring homomorphism 


n: F [x] > F(x] 
given by 
Nag ta,xt °°: $a,x)=a%(ao) + a%(a,)xt+ +: ta%(a,)x’. 
We note that n(/(x)) = /(x). Because n(x — a = (x — B)*, we get that 


(x — 8} is a factor of f(x), so k’ = k. The roles of a and f can be inter- 
changed to show that k2=k’. Hence, kK =k’. O 


3.7 Corollary. If f(x) © F[x] is irreducible over F, then f(x)= 
allv_,(x — a,)*, where a, are the roots of f(x) in its splitting field over F, 
and k is the multiplicity of each root. 


Proof. Obvious from Theorem 3.6. O 


3.8 Example 


Let K = F(x) be the field of rational functions in one variable x over a field 
F of characteristic 3. (Indeed, F(x) is the field of fractions of the polyno- 
mial ring F[x].) Then the polynomial y? — x in the polynomial ring K[y] 
over K ts irreducible over K and has multiple roots. 


Solution. If y3 — x has a root in K, then there exists g(x)/h(x) in K with 
h(x) * 0 such that (g(x)/A(x) = x; that is, g°(x) = xh3(x). But this im- 
plies that 3 (degree of h(x)) + 1 = 3 (degree of g(x)), which is impossible. 
Thus, y? — x € K[y] is irreducible over K. Now if 8, and £, are two roots 
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of y? — x in its splitting field, then 8? = x = £3. But then (8, — £,)? = 
B} + (— 1)383 = 0, and, hence, 8, — 8, = 0. This shows that y? — x has 
only one distinct root whose multiplicity is 3. This completes the solution. 


In the next section we show that an irreducible polynomial over a finite 
field has only simple roots. Hence, it will follow then that the only fields 
over which an irreducible polynomial may have multiple roots are infinite 
fields of characteristic p ¥ 0. 


Problems 

1. Verify that (f(x) + 9(x))’ =f'(x) + 8’(x). 

2. Let F be a field and a,h © F. Show that for any polynomial 
S(x)eF [x] of degree n, 

7 | 

flat y= > BSMan 
where f(x) = (f®- (x) and f(x) = f’(x), k = 2,3,.... (fa) 
is called the kth derivative of f(x) at x = a.) 

3. Show that f(x) € F[x] has a root @ of multiplicity n > 1 if and 
only if f(a) = 0, k = 1,...,2 — 1, and f(a) # 0, where f(a) is 
the ith derivative of f(x) at x = a as defined in Problem 2. 

4. Letf(x) bea polynomial of degree 7 over a field F of characteristic 
p. Suppose f’(x) = 0. Show that p|7 and that f(x) has at most n/p 
distinct roots. 


4 Finite fields 
Definition. A field is called prime if it has no proper subfield. 


Clearly, every field F contains a prime field — namely, the intersection 
of the family of its subfields, called the prime field of F. 

We remark that Q and Z/( p), p prime, are prime fields. The following 
theorem shows that these are the only two kinds of prime fields. 


4.1 Theorem. The prime field of a field F is either isomorphic to Q or 
to Z/(p), p prime. 


Proof. Consider the mapping f: Z —> F given by f(n) = ne, ethe unity of F. 
It is easily checked that fis a homomorphism. 

Case 1. Ker f= (0) (or, equivalently, char F is 0). Then / is an embed- 
ding of Z into F. This embedding can be extended to an embedding / *: 
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Q — F by defining f*(m/n) = me/ne. Thus, Q embeds in F, and, hence, 
the prime field of F is isomorphic to Q. 

Case 2. Ker f # 0. Because Z is a PID, Ker f= (m), ma positive integer. 
By the fundamental theorem of homomorphisms of rings Z/(m) = Im f. 
This shows that Z/(m), being isomorphic to a subring of the field F, has no 
proper divisors of zero, so m must be a prime number p. Thus, Z/(p) 
embeds in F. Hence, the prime field of F is isomorphic to Z/(p). O 


4.2 Theorem. Let F be a finite field. Then 


(i) The characteristic of F is a prime number p and F contains a 


subfield F, = Z/(p). 
(ii) The number of elements of F is p" for some positive integer n. 


Proof. The proof of (i) follows from Theorem 4.1. 

To prove (ii), we regard F as a vector space over its prime field F,. Let 
(e,,...,€,) be a basis of F over F,. Then any element x € F can be wnitten 
uniquely as 

X=a,e, + +++ +4,@,, a,€ F,, i = 1,2,...,n. 


The number of elements of F is thus p", since each a, in the previous 
expression for x can be chosen in p ways, p being the number of elements 
of F,. O 


A finite field is also called a Galois field. A Galois field with p” elements 
is usually written as GF(p"). 

We next prove that finite fields are splitting fields of suitable polyno- 
mials over F,, the subfield with p elements. 


4.3 Theorem. Any finite field F with p" elements is the splitting field of 
xP" — x € Fx]. Consequently, any two finite fields with p" elements are 
isomorphic. 


Proof. In the finite field F with p” elements the nonzero elements form a 
multiplicative group of order p" — 1. Thus, if 0 #/ € F, then A?"—' = 1, 
so JP" = J, Also, if A = 0, then J?" = J. Hence, all the p” elements of F 
satisfy the equation x“ — x = 0. Because x™ — x € F,[x] has only p” 
roots, it follows that F coincides with the set of roots of x?" — x. 

Let E and F be two finite fields with p” elements. By Theorem 4.2, E 
and F contain subfields E, and F,, each of p elements. Also, E and F are 
splitting fields of x" — x over E, and F,, respectively. But since E, = F,, 
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it follows by uniqueness of splitting fields (up to isomorphism) that E = F. 
This proves the theorem. O 


The next theorem shows that there exists a field with p” elements for 
any prime p and positive integer n. As a matter of notation, we denote 


Z/(p) by Z,. 


4.4 Theorem. For each prime p and each positive integer n= | the 
roots of x” — x € Z,[x] in its splitting field over Z, are all distinct and 
form a field F with p" elements. Also, F is the splitting field of x®" — x over 
Zp: 


Proof. Let f(x) = x?" — x. Because f’(x) = p"x”"—' — 1 #0, by Theorem 
3.3, f(x) cannot have multiple roots. Thus, /(x) has all its p” roots distinct. 
We show that these roots form a field that is the splitting field of f(x) over 
Z,. Let a and £ be roots, where £ is different from zero. Then 


(at BP’ =-a" + P=—at BP. 

(aB-')P* = cP"( BP)! = af. 
Thus, the set of roots forms a subfield of the splitting field and, therefore, 
coincides with it. O 


We now show that any finite field has an extension of any given finite 
degree. 


4.5 Theorem. If F is a finite field with p" elements and m is a positive 
integer, then there exists an extension field E of F such that [E: F] = m, 
and all such extensions are isomorphic. 


Proof. Let F be the algebraic closure of F. Consider the polynomial 
f(x) = xe" —xE F(x]. If O# UE F, then wu*-'=1, because the 
multiplicative group of F is of order p” — 1. In addition, because n|mn, 
(p"” — 1|p™" — 1), which gives u?""' = 1; thatis, vu?" = u. This shows that 
each element of F satisfies /(x). 

By Theorem 4.5 the p™ roots of {(x) are distinct and form a field E. 
Thus, we have the tower 
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Next we prove an important result regarding the multiplicative group 
of nonzero elements of a finite field. 


4.6 Theorem. The multiplicative group of nonzero elements ofa finite 
field is cyclic. 


Proof. We cite a result in group theory: Let a and b be elements of a finite 
abelian group G of orders m and n, respectively. Then there exists an 
element c € G whose order is the I.c.m. of mand n(Problem 14, Section 3, 
Chapter 4). 

Let F* be the multiplicative group of nonzero elements of F. By an 
application of the result cited, we can find an element a € F* whose order 
ris the l.c.m. of the orders of all the elements of F*. Then the order of each 
element of F* divides r. Hence, for alla € F*, a’ = 1. Because the polyno- 
mial x’ — | has at most r roots in F, it follows that the number of elements 
in F* s r. However, 1,ay,....@’—! are all distinct and belong to F*. Thus, 
F* is generated bya. O 


As an immediate consequence we have 


4.7 Corollary. Let E be a finite extension of a finite field F. Then 
E = F(a) for some aé E. 


Proof. Let the multiplicative group E* of nonzero elements of E be gener- 
ated by a. Then the smallest subfield F(a) of E containing F and a is E 
itself. O 


Corollary 4.7 gives an important result contained in 


4.8 Theorem. Let F be a finite field. Then there exists an irreducible 
polynomial of any given degree n over F. 


Proof. Let E be an extension of F of degree nm (Theorem 4.5). Then 
E = F(a) by Corollary 4.7. Because E is a finite extension of F, a € E is 
algebraic over F. Let p(x) be the minimal polynomial of a over F. Then 
[F (a): F] = degree of p(x). But since F(a) = E and [E: F] = n, we have 
an irreducible polynomial p(x) of degree n over F. O 


4.9 Examples 


(a) Show that a finite field F of p” elements has exactly one subfield with 
p™ elements for each divisor m of n. 
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Solution. We quote a result in group theory: A cyclic group of order hasa 
unique subgroup of order d for each divisor d of 7. 

Now F* = F — {0} is a cyclic group of order p" — 1. Because mln, 
p™ — 1 divides p” — 1. Thus, there exists a unique subgroup H of F* of 
order p™ — 1. So for all x € H, x°"—' = 1. Hence, x°” = x forallx€ HU 
{0}. Because the roots of x?" = x form a field, H U (0) is our required 
subfield of the given field. 


(b) If f(x) € F{x] is an irreducible polynomial over a finite field F, then 
all the roots of f(x) are distinct. 


Solution. Let F be a finite field with p” elements. By Corollary 3.5, f(x) has 
multiple roots if and only if f(x) = 27,a,(x°’)’. Because a, € F, af? = a,. 
Set b, = a?*"'. Thus, f(x) has multiple roots if and only if 


foy= Fb, = (Fo.x')p, 
i=0 i=0 
a contradiction, because (x) is irreducible. Thus, f(x) must have distinct 
roots. 
The next problem is a converse of Theorem 4.6. 
(c) If the multiplicative group F* of nonzero elements of a field F is 
cyclic, then F is finite. 


Solution. Let F* = (a), where a generates F*. If F* is finite, then F is 
finite, and we are done. So assume F* is an infinite cyclic group. 

Case |. The characteristic of F is p > 0. In this case F = F(a), where 
F, is the subfield (0,1,2,...,p — 1} of F. Consider 1 + a. If 1 + a = 0, then 
at =x |, a contradiction, because F® is infinite. If 1 + a #0, then 1 + 
aé& F*,so 1 +a@=a’, where r is some positive or negative integer. In 
either case 1 + a = a’ yields a polynomial over F, with a@as its root. Thus, 
a is algebraic over F,, so{F,(a): F,,] = degree of the minimal polynomial 
ofa over F, = r, say. Then F = F,(a) has p’ elements, a contradiction. So 
either the characteristic of F is 0, or F* must be finite. 

Case 2. The characteristic of Fis 0. HereO # 1 € F.So— 1 = a’, where 
ris some positive or negative integer. This implies a?’ = 1; that is, o(a) is 
finite, a contradiction. Hence, F* must be finite, so F must be a finite 
field. 


(d) The group of automorphisms of a field F with p* elements is cyclic 
of order n and generated by ¢, where $(x) = x?, x € F. (@ is called the 
Frobenius endomorphism.) 
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Solution. Let F bea field with p” elements. Let Aut (F’) denote the group of 
automorphisms of F. Clearly, the mapping @: F — F, defined by #(x) = 
x?, is a homomorphism. Let x? = jy”. Then (x — y)? = 0. Thus, x = y. 
This shows that @ is 1-1 and, hence, onto. Thus, @ € Aut(F). We note 
that @” = identity because @”"(x) = x?’ = x for all x € F. Let d be the 
order of @. We have @4(x) = x for all x € F. Hence, each x € Fis a root 
of the equation /°’— = 0. This equation has p? roots. It follows that 
d =n, whence d = n. Why? 

Let a be a generator of the multiplicative cyclic group F*. Then F = 
F(a), where F, is the subfield of F with p elements. Let f(x) be the 
minimal polynomial of a over F,. Clearly, the degree of f(x) = n. We are 
interested in counting the number of extensions of the identity mapping 
A: F,— F to an automorphism 4*: F — F. This will then give us all the 
automorphisms of F, because, clearly, any automorphism of F keeps each 
element of F, fixed. 

By Lemma 4.2 of Chapter 15 it follows that the number of automor- 
phisms of F is equal to the distinct roots of f(x). However, by Example (b), 
J(x) has all its roots distinct. Thus, the order of the group Aut(F) is n. 

We showed in the beginning that there exists an element @ € Aut(F) 
such that the order of ¢ is n. Hence, Aut(F) is a cyclic group generated 
by @. 


Problems 


1. IfF isa finite field of characteristic p, show that each element a of 
F has a unique pth root V@ in F. 

2. Construct fields with 4, 8, 9, and 16 elements. 

3. Find generators for the multiplicative groups of fields with 8, 13, 
and 17 elements. 

4. Find generators for the group of automorphisms of fields with 4, 
8, 9, and 16 elements. 

5. Let F bea field with four elements. Find irreducible polynomials 
over F of degrees 2, 3, and 4. 

6. If F isa field and f: F — F is a mapping defined by 


S(xy)=x7', x #0, 


f(x) =0, x=0, 
show that fis an automorphism of F if and only if F has at most 
four elements. 


7. Prove that in any finite field any element can be written as the 
sum of two squares. 
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8. If F is a finite field, then HU (0) is a subfield of F for each 
subgroup H of the multiplicative group F* if and only if |F'*| is 
either | or a prime of the form 2” — 1, where 7 is a positive 
integer. 

9. Show that x? —x-—1 is irreducible over Z,. Let K be the 
splitting field of degree p over Z,. Show K = Z,(w), where w 
is a root of x’? — x — 1. For p= 2,3, and 5, show that the order 
of w in the group K — {0} is 1+ p+p?+---+p?"'. 

10. Without actually computing, find the number of irreducible 
polynomials of (i) degree 2 and (ii) degree 3 over each of the 
fields Z, and Zs. 


§ Separable extensions 


Definition. An irreducible polynomial f(x) € F[x] is called a separable 
polynomial ifall its roots are simple. Any polynomial f(x) € F [x] is called 
separable if all its irreducible factors are separable. 

A polynomial that is not separable is called inseparable. 


Definition. Let E be an extension of a field F. An element a é E that is 
algebraic over F is called separable over F ifits minimal polynomial over F 
is separable. 

An algebraic extension E of a field F is called a separable extension if 
each element of E is separable over F. 


§.1 Remarks. (a) It follows by Corollary 3.5 that any polynomial over 
a field of characteristic zero is separable. Thus, if F is a field of characteris- 
tic 0, then any algebraic extension of F is separable. 

(b) By Example 4.9(b), irreducible polynomials over finite fields have 
distinct roots. Hence, any algebraic extension of a finite field is separable. 

(c) It was shown in Example 3.8 that if K = F(x) is the field of rational 
functions over a field F of characteristic 3, then the polynomial y* — x € 
K[y] is irreducible over K. Also, y? — x has all its roots equal, each being 
a, say. Hence, K(q) is not a separable extension of K. 


Definition. A field F is called perfect if each of its algebraic extensions is 
separable. 


Examples of perfect fields are fields of characteristic zero and finite 
fields. 
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We remarked that infinite fields of characteristic p > 0 have insepara- 
ble extensions. Thus, such fields are not, in general, perfect. 


Definition. An extension E of a field F is called a simple extension if 
E = F(a) for some aé E. 


§.2 Theorem. /fE is a finite separable extension of a field F, then E is 
a simple extension of F. 


Proof. If F isa finite field, then by Corollary 4.7, each finite extension E of 
F is simple. So suppose now that F is infinite. Because E is a finite 
extension of F, E = F(a,,...,a,), where a,€ E, | sis a, are algebraic 
over F. We first show that if E = F(a,f), then there exists an element 
6 € E such that E = F(@).Then the result will follow by induction. Let 
p(x) and q(x) be the minimal polynomials for a and £, respectively, over 
F. Let the roots of p(x) be a= a,,...,a,, and let those of g(x) be B= 
B, ...-»8 » Because E is a separable extension of F, alla,, 1 Ss isn, and all 
B,, 1s js m, are distinct. Because F is infinite, there exists a € F such 
that a #(a,—a)/(B—8,) for |sisn, 25 js m. Then a(f— B,) ¥ 
a,—a.SoaBp+a*a;+ af; forj # 1. Set6= af +a. Then 0 — af, * 
a, for all i= 1,...,n and j = 2,....m. Define A(x) = p(@ — ax) € F(8)[x]. 
Then h( 8) = p(a) = Oand h(B,) = p(6 — aB,) * 0 forj # 1.So Pisa root 
of h(x), but no 8, (j # 1) isa root of h(x). Also, B is a root of q(x). Regard 
q(x) € F(8)[{x]. Let A(x) € F(8)[x] be the minimal polynomial! of £ over 
F(@). Then A(x)|A(x) and A(x)|g(x). Then any root of A(x) is a root of q(x) 
as well as a root of /(x). But the only common root of g(x) and A(x) is £. 
Therefore, A(x) = x — 8. This implies that 8 € F(6). Then since 0 = 
aB +a, a€ F(6). Hence, F(a,f) = F(@). O 


The next theorem gives a necessary and sufficient condition for a finite 
extension to be a simple extension. 


§.3 Theorem. Let E be a finite extension of a field F. Then the follow- 
ing are equivalent. 
(a) E= F(a) forsomea€e E. 
(b) There are only a finite number of intermediate fields between F 
and E. 


Proof. (a) => (b) Let f(x) € F[x] be the minimal polynomial of @ over F. 
Let K be a subfield of E containing F, and let g(x) be the minimal 
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polynomial of a over K. Then since g(x) is in K[x], and f(a) = 0, 
g(x)L f(x). If K’ is the subfield of K containing F and the coefficients of the 
polynomial g(x), then g(x) € K’[x], being irreducible over K, is also 
irreducible over K’. Also, F(a) = E implies K(a) = K’(a) = E. Thus, 
[E: K ] = degree of g(x) = [E: K’]. Hence, K = K’. 

Consider the mapping o from the family of intermediate fields to the 
divisors of f(x) in E[x], given by o(K) = g(x), the minimal polynomial of 
a over K. By what has gone before, a is 1-1. Because there are only finitely 
many divisors of f(x), the family of intermediate fields between F and E is 
also finite. 

(b) = (a) If F is a finite field, then E is a finite field, and the result 
follows from Corollary 4.7. So assume F is infinite. We first prove that for 
any two elements af € E there is an element y € E such that F(a,f) = 
F(y). For each a € F consider the linear combination y, = a + af of a 
and £. The fields F(y,) are intermediate fields between F and E. Because 
there are only a finite number of intermediate fields, there exist a,b € 
F,a#b, such that F(y,) = F(y,). But then y,,y,€F(y,) implies y, — 
y,€ F(y,). Thus, (a— b)BE F(y,), and, hence, BE F(y,). Then 
y, =a+ DBE F(y,) implies a € F(y,). Therefore, F(a,f) C F(y,). Be- 
cause F(y,) C F(a,f), our assertion is proved. 

We now choose u € E such that [F(u): F] is as large as possible. Then 
we claim E = F(u). Otherwise let x € E, x ¢ F(u). We can find an ele- 
ment ¢ € E such that F(¢) contains both u and x, with F(t) > F(u). This 
contradicts the choice of u. Hence, E = F(u). O 


5.4 Examples 


(a) Let E be an extension of a field F, and let a € E be algebraic over F. 
Then @ is separable over F iff F(a) is a separable extension of F. 


Solution. Let B € F(a). We show that £ is separable over F. We have 
F C F(B) C F(a). Let L be an algebraically closed field, and leto: F—> L 
be an embedding. Suppose p,(x) is the minimal polynomial of 8 over F 
that has m distinct roots. Then by Lemma 4.2 of Chapter 15, there are m 
distinct extensions, say 0,,...,0,,, Of o to F(). 

Further, let p,(x) be the minimal polynomial of a over F(8), and 
suppose p,(x) has n distinct roots. Then again by the same lemma, for 
each o,, | S is m, there are exactly n extensions o,,, | Sj Sn, to F(a). 

It is clear that the set of mn embeddings (o,,), 1 s is m, 1 Sj Sn, are 
the only possible embeddings from F(a) to L that extend o: F +L. 
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Now let p,(x) be the minimal polynomial of @ over F. Then 


[F(a): F] = degree p,(x) 
= number of distinct roots of p,(x), since a 
is separable over F (1) 
= number of extensions of the embedding ¢ 


to F(a). 


Moreover, a is separable over F implies a is separable over F(), and, 
hence, by the same reasoning as in the previous paragraph, 


[F(a): F(B)] = degree p,(x) 
= number of distinct roots of p.(x) (2) 
== number of extensions of each a; to F(a) 


= 7, 
Also, 
[F(8): F] = degree p,(x); 
and the number of distinct roots of p,(x) 


= the number of extensions of ¢ to F(£) 

=. (3) 
From (1)-(3), 

mn = [F(a): F) = (F(a): F(B)[F(B): F) 
= the degree of p,(x). 
Hence, m = degree p,(x) = the number of distinct roots of p,(x). Thus, 
p,(x) is a separable polynomial. Hence, f is separable over F. 
The converse is clear. 


(b) Let FC EC K be three fields such that E is a finite separable 
extension of F, and K isa finite separable extension of E. Then K isa finite 
separable extension of F. 


Solution. From Theorem 5.2 we know that E = F(a), K = E(B) for some 
ace E, BE K. Let y€ F(a,B), y € F(a). Then we have the diagram 


F(a, y) 


F(a) 


we 
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such that F(a) is a finite separable extension of F, and y is a separable 
element over F(a). We prove that y is separable over F. Let 


p,(x) = the minimal polynomial of a over F with degree m, 
P2(x) = the minimal polynomial of y over F(a) with degree n, 
p;(x) = the minimal polynomial of y over F with degree s, 
p,(x) = the minimal polynomial of a over F(y) with degree 1. 


Let o: F —> L bean embedding of F into an algebracially closed field L. 

Because a is separable over F, there are exactly m extensions (q,), 
1 sis m,ofoto F(a)(Lemma4.2, Chapter 15). Also, since y is separable 
over F(a), again by Lemma 4.2 of Chapter 15, there are exactly 7 exten- 
sions of each a, to F(a,a). Let us call these n extensions ¢,,,...,0,,, where 
lsism. 

Therefore, there are precisely mn extensions of ao: F—L to 
0,:F(ay)7~Lisism, lsjsn. 

By considering extensions of 0:F — L to F(a,y) via F(y), we obtain 
similarly that there are precisely st extensions to F(a,y). Hence, mn = st. 

Suppose y is not separable over F. Then the number of extensions of 0 
to F(y) is < s (Lemma 4.2, Chapter 15). This implies that the number of 
extensions of a to F(a,y) is < st = mn, acontradiction. Hence, y is separa- 
ble over F. 

(c) If K is a field of characteristic p * 0, then K is perfect if and only if 
K? = K (i.e., if and only if every element of K has pth root in K). 


Solution. Suppose K is perfect. Let a be any element of K. We claim that 
there is an element 5 in K such that a = b?. We must show that the 
polynomial /(x) = x? — a has a root in K. Let 5 be a root of f(x) in some 
extension field of K. Because K is perfect, bis separable over K = K(a) = 
K(b?). Let p(x) be the minimal polynomial for 5 over K. Because bisa root 
of x? — b? € K[x], p(x) is a factor of x? — 5° in K[x]. In K[x] we have the 
decomposition x? — b? = (x — b)?. So p(x) is a power of x — 5. But 5 is 
separable over K, so p(x) has no multiple roots. Hence, p(x) = x — 5. 
Because p(x) € K[x], it follows that 5 € K. 

Conversely, suppose that every element of K is the pth power of an 
element of K. To show that K is perfect, we show that every irreducible 
polynomial of K[x] has distinct roots. Let p(x) € K[x] be irreducible. 
Assume, for contradiction, that the roots of p(x) are not distinct. Then by 
Corollary 3.5, p(x) has the form ay + a,x? + a,x? + °*+ +4,x”™, 
where @o,...,4, © K. By hypothesis there exist elements 5p,...,5, € K such 
that a,= b? (i =0,]1,...,”). Then since K has characteristic p, p(x) = 
(by + b,x + °*> + b,x"), which is a contradiction. So every irreducible 
polynomial of K[x] has distinct roots. Hence, K is perfect. 
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Problems 


f&wn- 


AN 


Prove Q(v2,V¥3) = Q(v2 + V3). 

Find 6 # V3 + V5 such that Q(v3,V5) = Q(6). 

Find @ such that Q(V2,w) = Q(0), where w? = 1,w # I. 

Let F = Z/(p), and let E = F(x) be the field of rational functions, 
where p is prime. Let K = F(x”). Show that K is not perfect. 
(Consider y? — x” € K[y].) 

Prove that a finite extension of a finite field is separable. 
Prove that every extension of Q is separable. 

Let a be a root of x? —x—1 over a field F of characteristic 
p. Show that F(a) is a separable extension of F. 


CHAPTER 17 


Galois theory 


In this chapter we deal with the central results of Galois theory. The 
fundamental theorem on Galois theory establishes a one-to-one corre- 
spondence between the set of subfields of E, where E isa splitting field ofa 
separable polynomial in F [x], and the set of subgroups of the group of 
F-automorphisms of E£. This correspondence transforms certain prob- 
lems about subfields of fields into more amenable problems about sub- 
groups of groups. Among the applications, this serves as the basis of 
Galois’s criterion for solvability of an equation by radicals, as discussed in 
the next chapter, and provides a simple algebraic proof of the fundamen- 
tal theorem of algebra. 


1] Automorphism groups and fixed fields 


Let E be an extension of a field F. We denote by G(E/F) the group of 
automorphisms of E leaving each element of F fixed. The group G(E/F) is 
also called the group of F-automorphisms of E. Throughout this section 
we confine ourselves to finite separable extensions and their groups of 
automorphisms. We recall that a finite separable extension E of F is 
simple. Thus, E = F(a) for some a & E. Let p(x) be the minimal polyno- 
mial of a over F. Then [F(a): F] = degree of p(x) =n, say. Also by 
Lemma 4.2, Chapter 15, we get that the order of the group G(E/F) iss n. 
Thus, we have 


1.1 Theorem. If E is a finite extension of a field F, then\G(E/F)|s 
[E: F]. 
322 
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1.2 Examples 


(a) Consider G = G(C/R). Ifa + ib € C, a,b € R, and oc € G, then o(a + 
ib) = o(a) + o(i)o(b) = a+ oa(i)b. Also —1 =o0(—1) = afi?) = (a(t). 
Thus, o(i) = ti. Hence, o(a + ib) = a+ ib. Thus, there are only two 
possible R-automorphisms of C. Hence, |G| = 2. In this case [C:R] = 2 
also 


(b) Consider G = G(Q(¥3)/Q). x3 — 2 is the minimal polynomial for 
¥2 over Q, so 92) :Q] = 3 and Q(¥2) has a basis (1,¥2,V4) over Q. Let 
a: Q(V2) —> Q(V2) be an automorphism such that o(a) = a for alla € Q. 


Then 
a(a + bV2 + cV4) = o(a) + a(b)a(V2) + a(c)o(V4) 
=q+ bo(3/2) + co(3/4). 
Also 


(o(¥2))? = o((V2)?) = 0(2) = 2, 
SO a(V2) is a cube root of 2. Therefore, o(V2) = ¥2, ¥2w, or ¥2w?, where 
w? = 1, w * 1. Because o(¥2) is real, the only possibility is 0(¥2) = V2. 
Hence a is the identity. Thus, G(Q(¥2)/Q) is the trivial group. 


Definition. Let E be any field, and let H be a subgroup of the group of 
automorphisms of E. Then the set E,, = (x € Ejo(x) = xforalla © H} is 
called the fixed field of the group of automorphisms H. 


It is easy to verify that E,, is a subfield of E. If E is an extension of a field 
Fand H < G(E/F) then FC E,,C E. For example, C,;, = R, where H = 
G(C/R), and Q(¥2) = Q(¥2), where H = G(Q(¥2)/Q). 

One of the main results in this section is 


13 Theorem. Let H be a finite subgroup of the group of automor- 
phisms of a field E. Then 


[E: Ey) =|). 


Before we prove this theorem we prove a lemma due to Dedekind. 


1.4 Lemma (Dedekind). Let F and E be fields, and let 6, ,03,....0, be 
distinct embeddings of F into E. Suppose that, for a, ,dz,....a,€ E, 
D7, a,0,(a) = Oforalla € F. Then a,= 0 forall i = 1,2,...,n. (This is also 
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expressed by saying that distinct embeddings of F into E are linearly 
independent over E.) 


Proof. Suppose there exist elements a,,...,a, € E, not all zero, such that 
a,0,(x) + +++ +4,0,(x) = 0 (1) 


for all x € F. Ofall the equations of the form (1) there must be at least one 
for which the number of nonzero terms is least. Let 


b,0,(x) + -** +5,,0,,(x) = 0, (2) 
for all x € F, be such a relation, where 5, € E, i = 1,...,m, and no b, = 0. 
Clearly, m> 1. 


Because o, ¥ a,, there exists y € F such that o,(y) # ¢,,(y). Replace x 
by yx in (2) and obtain 


b,o,(yx) + +++ + b,06,( yx) = 0 
for all x € F. But this implies 
b,o,(y)o,(x) + °+ + bp On(Y)Gm(x) = 0 (3) 
for all x € F. If we multiply (2) by o,(y) and subtract from (3), we get 
b(5,(y) — 0,(y))02(x) + 2°* + by (O(Y) — On Y)Om(X) = 0. 


The coefficient of a,,(x) 1s 6,,(0,(y) — 0,,(y)) # 0. So we have an equation 
of the form (1) with fewer than m terms, a contradiction in view of the 
choice of relation (2). This proves the lemma. O 


Proof of Theorem 1.3. Let H = (e = g,,...,8,), and let [E: E,] = m. If 
possible, suppose m <n. Let (a; ,...,@,,} be a basis for E over E,,. Consider 
the system of m homogeneous linear equations 


8,(a))x, + ++ + 8,(a)x, = 0, 


j= 1,2,...,.m, in n unknowns X,,...,X,. Because n > m, this system has a 
nontrivial solution. So there exist y,,...,y, € E, not all zero, such that 


8:(a))y, + «°° +8,(a)y, = 0 (1) 
j= 1,...,m. Let a be any element of E. Then 
a= 040, + *** + Om, 


where @,,...,@,, © Ey. So 
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m 
a.ai)y te Se Se (3 2,41) 
1 


im] 


m 


Bi(a)y, + --: +2,(a)y, = ai( 


{= 


re > a8(a)y, + + > A282 A)Vn 
im] im 


= yy a(g(a)y, + °°: + 8,(a)yn) = 9, 
i=] 
by (1). 


Hence, the distinct embeddings g,,...,g, are linearly dependent, which 
contradicts the lemma. Therefore, m = 7. 

Suppose, if possible, m > n. Then there exists a set of n + 1 elements of 
E that are linearly independent over E,,; let such a set be (a,,...,4,41). 
Consider the family of n homogeneous linear equations 


B(a,)x, ye ee + 8(an4; nti 0, l= 1,...57, 


inn + 1 unknowns x,.,...,.X,4;-. 90 there exist )',,...,.¥,4, © &, not all zero, 
such that 


BA), Hots + BlAns i) nas = 0, (2) 


j = I,...,2. Choose ),,...,¥,4, 80 that as few as possible are nonzero, and 
renumber so that 


y,#0, i=1,...,7 and Vr, =O= os = Vag, 


Equation (2) now becomes 


&a,)y, + -** + 2e,(a,)y, = 0. (3) 
Let g © H and operate on (3) with g. This gives a system of equations 

88(a,)8(V;) + +++ + 88(a,g(y,) = 0, (4) 
which 1s clearly equivalent to the system of equations 

&fa,)g(y,) + ++ + Bfa,)g(y,) = 0. (4) 


If we multiply relations (3) by g(y,), and (4’) by y,, and subtract, we 
obtain 


&(A2K 28 (Y;) — B(V2)Mi) + os + Ba MK y,8(1) — B(y)y) = 9, (5) 


j = I,,...,2. (5) 1s a system of equations like (3) but with fewer terms, which 
gives a contradiction unless all the coefficients y,2(y,) — y,g@(y,) are equal 
to zero. If this happens, then y,y;'! = g(y,y7;') for all g€ H. Thus, 
yiyy'€ Ey. Therefore, there exist z,,....z,€ E, such that y,= y,z,, 
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i= |,...,7. Then relation (3) with j = 1 becomes 
8:(4,) 12, + °°* +28,(a,)y,z, = 0. 
This implies 
8,(a,)z, + -*+ + 8,(a,)z, = 0, 
because y, * 0. But then, because z, € E,,, i = 1,...,7, we get 
8,(4,2,) + +++ + 8,(a,z,) = 0, 


QZ, + ++: +a,z,=0, 


because g, is a nonzero embedding. But the linear independence of 
a,,...,a, over E,, yields that z, = 0 = --- =z,. This implies y, ~0= 
**+ = y , a contradiction. Hence, [E: E,,] =n =|H|, as desired. O 


As an immediate consequence of Theorems 1.1 and 1.3, we have 


1.5 Theorem. Let E be a finite separable extension ofa field F, and let 
H < G(E/F). Then G(E/E,) = H, and [E: Ey) =|G(E/E,)\. 


Proof. Clearly, H < G(E/E,,). By Theorem 1.3, |H|= [E: E,]. Also by 
Theorem 1.1 we have 
|| = [E: Ey] =|G(E/E,,)| = |H|. 


Hence, H = G(E/E,) and [E: Ey] =|G(E/E,,)|. This completes the 
proof. O 


Another important consequence of Theorem !.3 is the following char- 
acterization of normal separable extensions. 


1.6 Theorem. Let E be a finite separable extension of a field F. Then 
the following are equivalent: 


(i) Eisanormal extension of F. 
(iti) F is the fixed field of G(E/F). 
(iii) [E:F]) =|G(E/F)\. 


Proof. Because E is a finite separable extension of F, E = F(a) for some 
a € E(Theorem 5.2, Chapter 16). Let p(x) be the minimal polynomial of 
a over F, and let its degree be 1. Then 
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[E: F) = (F(a): F] =n. (1) 
Also, if E, is the fixed field of G(E/F), then, by Theorem 1.3, 
[E: Eo) = |G(E/F)|. (2) 


(i) => (ii) By Lemma 4.2 in Chapter 15 the number of extensions of the 
inclusion mapping F — F to the embedding F(a) — F is equal to the 
number of distinct roots of p(x). Because E is a separable extension of F, 
a € Eisaseparable element, so its minimal polynomial p(x) over F must 
have distinct roots. So the number of distinct roots of p(x) is equal to 7. 

In addition, because E = F(a) is a normal extension of F, any embed- 
ding o: F(a) — F shall map F(a) onto F(a). Clearly, any member of 
G(E/F) is an extension of the inclusion mapping F — F. Thus, it follows 
that 


\G(E/F)| = number of distinct roots of p(x) = n. (3) 
Thus, (1)-(3) give 

[E:F] =n =|G(E/F)| = (E: Ep]. 
Hence, [E,: F] = 1, so Ey = F as desired. This proves (i) = (11). 

(11) => (1) As explained earlier, E= F(a) for some a@EE. Let 
G(E/F) = (o, = identity, 0,...,.0,). Consider the polynomial /(x) = 
(x — 0,(a@)X{x — 6,(a)) ++: (x — 0,(a)). Now each a, € G(E/F) induces a 
natural homomorphism 

of: E[x] —@ E[x], 
where 
OT (Ay + A,X + *°* +.,,X™) = O(a) + O,(4,)x + °°* +0,(4,,)x™. 
So 
of (f(x) = (x — (4,0, Ka) Kx — (0,02) *** (x — (6,0, K)). 
But since 06,0,,0;0,....0,6, are distinct members of G(E/F), these F- 


automorphisms are only a permutation of 4;,02,...,0,. Hence, 
of (f(x)) = f(x) for all i = 1,2,...,.n. Now by expanding /(x) we have 


SI) PR OT xt 3 + oe + (Ye, 
where c; € E. Therefore of(/(x)) = f(x) implies 
G,(C)) = ¢, for all i, j = 1,2,...,7. 


This gives that c, is in the fixed field of G(E/F). So, by hypothesis, c, € F, 
j = 1,,...,m. Hence, f(x) € F[x]. Also, all the roots of f(x) lie in E. Because 
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E = F(a) and a is one of the roots of {(x), E is a splitting field of f(x) € 
F [x]. This proves (ii) => (i). 

(ii) => (111) Follows from Theorem 1.5. 

(iii) => (ii) By equation (2) [E: Ey] =|G(E/F)|. Thus, by (iii), we get 
[E: E,] = (E: F). Hence, E, = F, proving (iii) => (11). This completes the 
proof of the theorem. O 


1.7 Examples 


(a) The group G(Q(a)/Q), where a’ = | and a * 1, is isomorphic to the 
cyclic group of order 4. 


Solution. Clearly, a — 1 =(a— 11 + a+ a? + a + a), soaisa root 
of a polynomial p(x) = 1 +x +x? + x3 + x* E Q[x]. Because p(x) is 
irreducible over Q, [Q(a):Q] = 4. Also, the roots of x°— 1 are 
1 ,a,a?,03,a*. So Q(a) is the splitting field of x5 — 1 € Q[x] and, hence, a 
normal extension of Q. Thus, 


IG(Q(a)/Q)| = [(Q(a):Q] = 4. 
Because { | ,a,a?,a°) is a basis of Q(a) over Q, a typical element of Q(q) is 
Ataataet+aj, a,EQ. 
The four Q-automorphisms of Q(q) are indeed the following: 


O12 Ay + a, + a0? + as03 > dy + a, + az,0? + aa’, 
Oy: Ay + A, + az? + aya? > dy + a,a? + az,a4 + aya® 
= a, + a,a? + a,a* + aa, 
3: Ay + A, + az,002? + as03 > dy + a0 + a,0° + aya? 
= Qa, +a,a> + a,a + a,c, 
O42 Ag + A, + a0? + a,03 > dy + 04 + a,0° + aya? 
= a, + a,a* + a,03 + a3. 


These form a cyclic group of order 4 generated by a, or 03. 


(b) Let E= Q(V2,), where w? = 1, w * 1. Let a, be the identity 
automorphism of E, and let a, be an automorphism of E such that 
o,(w) = w? and a,(2) = o(V2). If G = {o,,0,) then Eg = Q(V2w?). 


Solution. Now a & Eg if and only if o,(a) = a. Consider the following 
diagram of towers of fields: 
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Q7/72, w) 
basis (1, w) 
Q 72) 
basis (1, 0/2, 3/4) 


Thus, (1,¥2,¥4,w,w¥V2,wV4) is a basis of Q(¥2,w) over Q. So let 
a=atbV¥2+cV4 + dw + ew V2 + fw V4 


be an element in Q(V2,w). Then 
0,(a)=at bV2w + cw2V4 + dw? + ew*wi2 + fw*wV4 
=a+ bwi2 + c(—1 — w)V4 + d(—1 — w) + e¥2 + fwV4 
=(a—d)+e2—- V4 — dw + bw¥2 — (c— f)wV4. 
So o,(a)™=a implies a=a-—d, b=e, c=—c, d=—d, e=b, and 
f=-—c+t/f. So we obtain d=0=c, b=e, and f, a arbitrary. Thus, 
a= at bV2 (1+ aw) + fwV4 
= a— bw?V2 + fw4 
=a — bw?W2 + f(w? 2). 
Therefore, Eg = Q(w?¥?2). 


Problems 


1. Let E=Q(¥2,w) be an extension of a field Q, where w? = 1, 
w 1. For each of the following subgroups 5S, of the group 
G(E/Q) find Es. 

V2 > V2w?, 

wr w?, 

V2 -» V2, 

wr w?, 

V2 V2, 


wW— w?, 


(a) 5S, =(1,0,), where o,: 
(b) S> = (1,03), where 03. 


(c) S;=(1,0,}, where o,: 


(d) S,=(1,05,0,), where 
be —» V2, {2 —» 202, 
0s: and 40,: 
ww, ww. 
2. Let E be a splitting field of x* — 2 € Q[x] over Q. Show that 
G(E/Q) is isomorphic to the group of symmetries of a square. 
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Find subgroups of order 4 and their fixed fields. (Hint: E = 
Q(92,i). Ifo is the automorphism of E given by o(V2) = iV2 and 
o(i) = i, then o generates a subgroup of order 4. Also, if t is the 
automorphism given by 7(V2) = V2 and 1(i) = —i, then we can 
describe all elements of G(E/Q) in terms of o and t by means of 
the relation o* = 1 = t?, to = o°t. Also, there are three sub- 
groups of order 4: 


C,:{1,0,07, 07}, 

C41: {1,07, t, 07}, 

C42: {1,07, 01, 0°t}. 

Q(i), Q(¥2), Q(i¥2) are respectively the fixed fields of C,, C,,, 


a2] 


2 Fundamental theorem of Galois theory 


We are now ready to prove the fundamental theorem of Galois theory. 
First we give a couple of definitions. 


Definition. Let f(x) € F[x] be a polynomial, and let K be its splitting field 
over F. Then the group G(K/F) of F-automorphisms of K is called the 
Galois group of f(x) over F. 


Definition. A finite, normal, and separable extension E of a field F is called 
a Galois extension of F. 


For example, if f(x) € F[x] is a polynomial over a field F of character- 
istic zero, then its splitting field E over F is a Galois extension of F. 


2.1 Theorem (fundamental theorem of Galois theory). Let E bea 
Galois extension of F. Let K be any subfield of E containing F. Then the 
mapping K -> G(E/K) sets up a one-to-one correspondence from the set of 
subfields of E containing F to the subgroups of G(E/F) such that 


(ii) For any subgroup H of G(E/F), H = G(E/Ey,). 
(iti) (E:K]=|G(E/K)|, (K:F] = index of G(E/K) in G(E/F). 
(iv) K is anormal extension of F if and only if G(E/K) is a normal 
subgroup of G(E/F). 
(v) IfKis anormal extension of F, then G(K/F) = G(E/F)/G(E/K). 
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Proof. By definition of normality, it follows that £ is a normal extension of 
K. Thus, K 1s the fixed field of G(E/K). This proves (i). 

The proof of (ii) follows from Theorem 1.5. Note that the proof needs 
only the fact that E is a finite separable extension of F. 

Because E is a normal extension of F and also of K, we have, by 


Theorem 1.6, 

[E:F]=|G(E/F)| and [E:K] =|G(E/K)|. 
Thus, 

[E:F]) = (E: K)[K:F] 
gives 

|G(E/F)| = |G(E/K)\[K: F]. 


This proves [K: F] = index of G(E/K) in G(E/F), as desired. 

Now we proceed to prove (iv). Let F be an algebraic closure of F 
containing E. Recall that K is a normal extension of F if and only if each 
embedding a: K — F, which keeps each element of F fixed, maps K onto 
K (Theorem 2.1, Chapter 16). 

We assert that K is a normal extension of F if and only if for each 
o € G(E/F), o(K) = K. If K is a normal extension of F and o € G(E/F), 
then o restricted to K is an embedding of K into E and, hence, into F. 
Thus, by Theorem 2.1 of Chapter 16, o(K) = K. Conversely, let 0: K — F 
be an embedding that keeps each element of F fixed. By Theorem 4.3, in 
Chapter 15, o can be extended to o*: E — F. But then o*(£) = E, be- 
cause E 1s a normal extension of F (Theorem 2.1, Chapter 16). Thus, 
o* € G(E/F). So, by hypothesis, o*(K) = K. Hence o(K) = K because o* 
is an extension of o. Thus, we have shown that if 0: K — F is an embedding 
that keeps each element of F fixed, then o(K)= XK. Therefore, by 
Theorem 2.1 of Chapter 16, K is a normal extension of F. This proves our 
assertion. 

Therefore, K is a normal extension of F if and only if for allo € G(E/F) 
and k € K, o(k) € K. Then for all t € G(E/K), t(o(k)) = o(k). This im- 
plies (o~'to)(k) = k for all kK € K. Hence, o~'to € G(E/K). This proves 
G(E/K) is a normal subgroup of G(£/F). Retracing our steps, we see 
clearly that 1f G(E/K) is a normal subgroup of G(E/F), then t(o(k)) = 
o(k) for all t € G(E/K), for all o € G(E/F), and for all ke K. We do 
know E is a normal extension of K. Hence, K is the fixed field of G(E/K). 
Thus, t(o(k)) = o(k) implies o(k) € K. Therefore, K isa normal extension 
of F, as desired. 

Finally, we prove (v). Let K be a normal extension of F. By the preced- 
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ing discussion, for all o € G(E/F), o(K) = K. Thus, o induces an auto- 
morphism o* of K defined by 0*(k) = o(k), k € K. Clearly, o* € G(K/F). 
Consider the mapping /: G(E/F) — G(K/F) defined by f(a) = o*. Let 
0; ,0, © G(E/F). Then 

(afozKk) = of(aX(k)) = of(0,(k)) = (0,02). 


Therefore, (o,0,)* = ofo}. Thus, fis a homomorphism of G(E/F) into 
G(K/F). Now Ker f= (ao € G(E/F)\o* = identity). But o* = identity if 
and only if o*(k) =k for all k € K. That is, o(k) = k for all k € K, so 
o © G(E/K). Hence, Ker f= G(E/K). Then by the fundamental theorem 
of homomorphisms 
G(E/F) 
G(E/K) 


Further, by (iii), we have 
eee 
G(E/K) 

Also, because K is normal over F, 
|G(K/F)| = [K: F]. (3) 


Hence, by (1)-—(3), we get 


G(E/F) 
G(E/K) 


This completes the proof of the theorem. O 


= Im fC G(K/F). (1) 


=(K:F). (2) 


= G(K/F). 


2.2 Examples 


(a) If f(x) € F[x] has r distinct roots in its splitting field E over F, then the 
Galois group G(E/F) of f(x) is a subgroup of the symmetric group S,. 


Solution. Let f(x) = @y +a,x+ +++ +a,x" € F[x] have r distinct roots 
Q, ,Qz,...,a,. Clearly, for each o € G(E/F), o(a,) is again a root of f(x). 
Also, if a, * a,, then o(a,) * o(a,). Thus, o(@,),0(a.),...,0(&,) 1S a permu- 
tation of a, ,...,a,. Let us set 6,(a,) = o(a,), i = 1,...,7. Then @, € S,, and 
we have a mapping /: G(E/F) — S, given by /(a) = ¢,. If 6,4 € G(E/F), 
then 


Pol) = (aKa) = O(n(a,)) = Do, (O) 
for all a,. Hence, ¢,, = $,9,. This shows that fis a homomorphism. 
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Toshow fis 1-1, let @, = identity. Then @,(a;,) = a;; that is, o(a,) = a; 
for all i = 1,...,7. Thus, o = identity, since E = F(q,,...,a,). Hence, fis an 
embedding of G(E/F) into S,. 


(b) Let F be field of characteristic # 2. Let x? — a € F[x] be an irre- 
ducible polynomial over F. Then its Galois group is of order 2. 


Solution. Clearly, if a is one root of x? — a, then —a is the other root. So 
a # —a because char F # 2. Thus, x? — a is separable over F. The split- 
ting field F(a) of x? — aover Fisa finite, separable, and normal extension 
of degree 2 over F. Thus, |G(F(a)/F)| = 2. 


(c) Let F be a field of characteristic # 2 or 3. Let f(x) =x? + bx +c 
be a separable polynomial over F. If f(x) is irreducible over F, then the 
Galois group of f(x) is of order 3 or 6. Also, the Galois group of f(x) is S; if 
and only if A = — 45? — 27c? is not a square in F; that is, there does not 
exist any element a € F such that a? = A. 


Solution. if f(x) has a root a € F, then f(x) = (x — @)g(x), where g(x) € 
F [x]. In case g(x) has a root in F, then f(x) splits into linear factors in 
F itself, so the Galois group of f(x) is of order |. In case g(x) is irreduc- 
ible over F, the splitting field E of f(x) over F is an extension of degree 
2, so |G(E/F)|= [E: F] = 2. Thus, if f(x) is irreducible over F, then 
|G(E/F)| ¥ 1 or 2. 

Because f(x) is also separable, all its roots are distinct. Hence, by 
Example (a), G(E/F) < S;, where E is splitting field of f(x) over F. Let 
G = G(E/F). Then |G|= 3 or 6, for otherwise f(x) is reducible, as dis- 
cussed earlier. 

Now we find necessary and sufficient conditions on the coefficients of 
J(x) such that G is isomorphic to S,. Ifa, ,a@,, and a, are the distinct roots 
of f(x), we let db = (a, — XQ — &,Ka, — a,) and A = 6?. 

Then for all o € G, o(6) = + 0. Hence, o(A) = A. Thus, A isin the fixed 
field of G; thatis, A € F. Nowifdé € F, then o(d) = 6 forallo € G. Thus, a 
cannot be an odd permutation. Hence, o € A;, the subgroup of even 
permutations of S,. Conversely, if ao € A;, then o(d) = 6. Hence, G = A, 
if and only if 6 € F. Thus, G = S, if and only if 6 ¢ F; that is, the polyno- 
mial x? — A € F[x] is irreducible over F (since A = 6? € F). By actual 
computation we have 


A = —4b} — 27¢?, 


Thus, the Galois group of f(x) is S, if and only if —46> — 27c? is not a 
square in F. 
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(d) The Galois group of x* — 2 € Q[x] is the octic group (= group of 
symmetries of a square). 


Solution. x4 — 2 is irreducible over Q, and x* — 2 = (x — V2Xx + V2) 
(x + iV2\x — i¥2) € Q(V2,i)[x]. Therefore, E = Q(V2,i) is the splitting 
field of x* — 2 over Q. By Example 1.3(d) in Chapter 16, [E:Q] = 8. Also, 
E is a normal separable extension of Q. This gives|G(E/Q)| = [E:Q] = 8. 
If co € G(E/Q) and BE E, then 


B= a, +a,V2 + a(v2) + a,(V¥2)3 + a,(i) + a,(i¥2) 
+ a,(i(V¥2)) + a,(i(V2)). 
So we have 
o( B) = ay + a,0(V2) + a,0(V2) + aso(V2)° 
+ a,0(i) + a,o(i)o(V2) 
+ a,a(i)o(¥2P + a,0(i)o(V2/. 

Therefore, o is determined by its effect on i and on V2. Because o(i) 
must be i or —i; because o(V¥2) must be V2, —¥/2, i¥2, or —i¥2; and 
because there are eight elements in G(E/Q), it follows that G(E/Q) = 
(0, ,0z,...,0g}, where 


Yio) In -¥f8 YP? 


O1: 
Ri, iri, iri, 
oe 4/2 — 4/2, 4/2-+i4/2 : 2+» —i4/2 
Ts . ° C2. : j O04. . . 
i- —I, bt 1, i> 1, 
xe Vint 4, Vn a 
i> —i, ir» —i, 
Let o, = ¢/2, a, = i6/2, a3 = — 4/2, a, = — 14/2. 
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Then the elements of G(E/Q) permute the roots a, ,a@2,a@;, and a, of 
x‘ — 2 as follows (see the diagram): 


o,: O° rotation; as: reflection about d,; 
a,: 90° rotation; — 4,: reflection about /,; 
a;: 180° rotation; g,: reflection about d,; 
0,: 270° rotation; a: reflection about /,. 


(e) Illustrate the 1-1 correspondence between subgroups and subfields 
by using the Galois group of x‘ — 2 € Q[x]. 


Solution. We saw in the solution to Example (d) that E = Q(V2,/) is the 
splitting field of x* — 2 over Q and that the Galois group G of x‘ — 2 isthe 
octic group. In the solution of that problem, o, is the identity automor- 
phism e. Set o, = co; then 0; = a”, 0, = a, and o* = «. Next set 0; = 7; 
then t? = €, 6, = oT, 0, = o7t, and 0, = o°t. Finally, we have to = o°t. 

The group G has four normal subgroups other than itself and 
the identity subgroup: N, = {€,01,07,0°t}, N, = {€,0,07,0°}, N3= 
(€,t,07,07t), and N, = (€,o7), and four nonnormal subgroups: H, = 
(e,0°t), H, = (€,ot), H, = (€,07t), and H, = (€,t). The inclusion relations 
between these subgroups are shown in the following diagram: 


G 


H, \ $ it, 
{e) 


We now describe the fixed fields under these subgroups. We begin with 
Ey,, the fixed field of the subgroup N,. Let a = 4/2 and let 


X= a, + a,a + a,0? + a0 + asi + Agia + azia? + agia? 
be any element of Q(a,/). Then x & Ey, iff x = 07(x). Because o(a) = ia 
and o(i) = i, we have 
o(x) = a, + aia — aya? — ayia? + asi — aga — azia? + aa? 
= A, — Ach — A,X? + agar? + asi + a,ia — a,ia? — a,ia’, 
a(x) = a, — a,a + a,0? — aa 
+ Asi — Agia + a,ia? — asia’. 
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Hence, x = o°(x) if and only if a. =—a@,, 4 = —Q4, Ag ™ —Qg, Ag™ 
—a,. SO a, = a,=a,= a, =0, and a@,,a3,a,, and a, are arbitrary. 
Therefore, 

X= a, + a,0? + asi + aia. 
Thus, Ey, = Q(v2,/). 

Next, to find Ey,, we note that x € Ey, iff x = t(x) = o%(x) = o?1(x). 

Let 

X= A, + a.a + azar? + a,02 + asi + Agia + azia? + azia’?. 
Because x = o°(x) implies a, = a, = @, = a, = 0, we have 

X =a, + a,0? + asi + azia?. 
Further, t(a) = @ and t(i) = —i. So 

1(X) = a, + a3? — asi — ayia’. 
Therefore, x = t(x) implies a, = 0 = a,, yielding 

X = (x) =a, + azar’. 
Applying o? we obtain 

o*t(x) = a, + axa’, 
since o(a) = a. This proves x € Ey, iffx = a, + aya’, where a, and a, are 
arbitrary. Thus, Ey, = Q(v2). Similarly, Ey, = Q(i¥2) and Ey, = Q(i). 
Ey, En,, En,, and Ey, are normal extensions of Q - being splitting fields 
of polynomials (x? + 1x? — 2), (x? — 2), (x? + 1), and (x? + 2), respec- 
tively. 

Similar computations show that E,,, = Q((1 — /)a@),£y,, = Q((1 + a), 

Ey, = Q(ia), and E;,, = Q(a). 


(f) The Galois group of x3 — 2 € Q[x] is the group of symmetries of the 
triangle. 


Solution. The roots of equation x? — 2 = Oare ; 2, w¥2, and w?V2, where 
w is a root of the irreducible polynomial x? + x + 1 over Q(V2). Thus, we 
write 

x3 —2 = (x — Y2Kx — wV2\(x — w?42), 


whence E = Q(V2,w). It follows that [E:Q] = 6. There are then exactly 
six automorphisms of E, for E is a splitting field and therefore normal. 
These automorphisms are determined by the manner in which they 
transform the roots of the above equation. The root ¥2 can have only 
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three images, w only two. There are six possible combinations, and be- 
cause there are six automorphisms, all combinations occur. The auto- 
morphism group is given by the table 


if o o? t OT ot 


2 w2 w2V2 2 w2 w?)2 


w Ww Ww w w? w? w? 


(g) The Galois group of x‘ + 1 € Q[x] is the Klein four-group. 


Solution. Let E = Q(a), where a = e*/*, Then because the roots of x* + | 
are a,a3,a5, and a’, E is the splitting field of x* + 1 over Q. Because 
x4 + 1 is irreducible over Q, [E:Q] = 4. Also the characteristic of Q is 
zero. So E is a normal separable extension of Q. Thus, |G(E/Q)| = 
[E:Q] = 4. Ifo € G(E/Q) and BE E, then 8 = ay + a, + a0? + a30 
and o( £) = ay + a,0(a) + a,0(ay + a,o(a). Hence, a is determined by 
its effect on a. Because o(a) must be a root of x* + 1, and because there 
are four elements in G(E/Q), it follows that G(E/Q) = (0, ,03,05,07}, 
where o,(a) =a, 0;(a~) = a3, 0,(~) = a°, and o,(a~) = a’.. Note that 
0;(0;(a)) = a° =a, 0,(0;(a)) = a =a, and o0,(a)) =a’ =a; so 
o% = go, = identity, 03 = o,, and o3 = a,. Therefore, G(E/Q) is the Klein 
four-group (every element except the identity has order 2) = C, X C). 


(h) Let 1 be a positive integer, and let F be a field containing all the mth 
roots of unity. Let K be the splitting field of x” — a € F[x] over F. Then 
K = F(a), where a is any root of x” — a, and the Galois group G(K/F) is 
abelian. 


Solution. lf w = cos(22/n) + isin (22/n) and a is any root of x” — a, then 
a,0W,...,a@"~' are all the roots of x” — a. Thus, the splitting field K of 
x" — aover Fis F(a). To show G(K/F) is abelian, we let 0, ,0, € G(K/F). 
Because a 1s root of x" — a, o,(a) and o,(q) are also roots of x” — a. So we 
may now write 


o,(a)=aw' and 0,(a) = aw/ 
forO <i, js n—1. Then 
(0,0,XQ) = 0,(0,(a)) = 0,(aw’) = o,(a)w! = aw'*!, 


Similarly, (0,0,{a) = aw), Hence, 0,0, = 0,0,, because K = F(a). 
Thus, G(K/F) is abelian. 
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Problems 
1. Find the Galois groups G(K/Q) of the following extensions K 
of Q: 


(a) K=Q(v3,¥5). 
(b) K=Q(q), where a = cos(22/3) + i sin(27/3). 
(c) XK 41s the splitting field of x* — 3x? + 4 E Q[x]. 

2. In Examples 2.2(/) and 2.2(g) find the subgroups of G(E/Q) and 
the corresponding fixed fields. 

3. Let ueR and let Q(u) be a normal extension of Q such that 
[Q(u):Q] = 2”, where m > 0. Show that there exist intermediate 
fields K, such that 


Ky=QCK,CK,C-:-CK,, = Q(u), 
where [K;: K,-,] = 2. 


3 Fundamental] theorem of algebra 


In this section we given an application of the fundamental theorem of 
Galois theory to prove that the field of complex numbers is algebraically 
closed. 


3.1 Theorem (fundamental theorem of algebra). Every polynomial 
S(x) € C[x] factors into linear factors in C[x]. 


Proof. Let f(x) = @g ta,x+ +++ +a,x" & C[x]. Let 
B(x) = (x2 + 1 dy + ayxt 20* +a,x"KAy + a,x+ +** +4,x"). 


Then g(x) € R[x]. Let E be the splitting field of (x) over R. Then 
RCCCE. We prove E=C. 

First, we assert that there does not exist a subfield K of E containing C 
such that [K:C] = 2. Suppose such a subfield exists. Because K is a finite 
separable extension of C, K = C(qa) for some aE K, If p(x) € C[x] is 
the minimal polynomial of @ over C, then deg p(x) = 2. Suppose 
p(x) = x? + 2ax + b. Then 


p(x) = (x + a? — (a2 — b) = (x + a — Va? — bxx + a + Va? — b), 


which is a contradiction, because p(x) ts irreducible over C (observe that if 
c EC, then vc € C). This proves our assertion. 

Let G = G(E/R) be the Galois group of g(x) over R. Let |G| = 2g, 
where g is an odd integer. Let H be the 2-Sylow subgroup of G, and let L be 
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the corresponding subfield of E. We exhibit the foregoing considerations 
in the following diagram: 

Ece—,- (e} 

L-——H 

C+—— G(E/C) 

R -——— G(E/R). 


Then [E:L] = 2”, so [L:R] = gq. Also because L is a finite separable 
extension of R, L = R() for some 8 € L. Thus, the minimal polynomial 


Q(x) = bg + Ox + --* + OxVE R[x] 


of 8 is of odd degree g. But this is impossible unless g = 1, because by the 
intermediate value theorem in calculus every equation of odd degree over 
the reals has a real root. Hence, g = 1, so [E:R] = 2”. This implies that 
[E:C] = 2™—'. Then the subgroup G(E/C) of G(E/R) is of order 2”—'. If 
m> 1, let H’ be the subgroup of G(E/C) of order 2-2, and let L’ be 
the corresponding subfield of E. Clearly, L’ > C. Then [E:L’] = 2-2, 
so [L’:C] = 2, a contradiction of the assertion proved in the begin- 
ning. Hence, m= 1. Then [E:R] =2 with EDC implies E =C, as 
claimed. O 


CHAPTER 18 


Applications of Galois theory to classical 
problems 


1 Roots of unity and cyclotomic polynomials 


Definition. Let E be a field, and let n be a positive integer. An element 
w € E is called a primitive nth root of unity in E ifw" = 1, butw™ * | for 
any positive integer m <n. 


Note that the complex numbers satisfying x" = | form a finite sub- 
group H of the multiplicative group of the nonzero elements of the field C 
of complex numbers. Also H is a cyclic group generated by any primitive 
nth root w of unity. There are exactly (7) primitive mth roots of unity for 
each positive integer 7. These are cos(2kz)/n + i sin(2kz)/n, where k is a 
positive integer less than nm and prime to n. 


1.1 Theorem. Let F be a field, and let U be a finite subgroup of the 
multiplicative group F* = F — {0}. Then U is cyclic. 
In particular, the roots of x" — 1 € F[x] form a cyclic group. 


Proof. By Lemma 3.3 of Chapter 8, U = S(p,) X -:+ X S(p,), where 
|S(p,)| = pi, and p,,...,p, are distinct primes. We proceed to show that 
each S(p,) is cyclic. Let a € S(p,) be such that o(a) is maximal, say p,’'. 
Because 0(a)|p,", we have s,s 7,. Further, for each x € S(p,), o(x) = 
pi s pi. Therefore, x*/ = 1 implies x7 = 1. Hence, for all x € S(p,), 
x?" = |, Because the equation x” = 1 has at most p? roots, it follows that 
pi = p;'. But this yields s,=7,, since p/' = p}'. Therefore, o(a) = pi' = 
|S(p,)|, and, hence, S(p,) is a cyclic group generated by a. We also know 
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that if A and B are cyclic groups of orders m and n, respectively, with 
(m,n) = 1, then A X B is again cyclic [Example 4.5(a), Chapter 4]. 
Hence, U is cyclic. O 


1.2 Theorem. Let F be a field and let n be a positive integer. Then 
there exists a primitive nth root of unity in some extension E of F if and 
only if either char F =0 or char F In. 


Proof. Let f(x) = x" — 1€ F[x], and let char F = O or char F Yn. Then 
Sf’ (x) = nx"! #0. Thus, f(x) has n distinct roots (in its splitting field E 
over F), and they clearly form a group, say H. So by Theorem 1.2 this 
group H, consisting of the n distinct roots of x” — 1, isacyclic group. Now 
if w € H is a generator of H, then w" = 1, but w” * | for any positive 
integer m <n. Hence, w is a primitive mth root of unity in an extension 
field E of F. 

Conversely, let w be a primitive mth root of unity in some extension 
field E of F. Then 1,w, w?,...,@"—' are clearly n distinct roots of f(x) 
x"— 1. So f(x) does not possess multiple roots. But then /’(x) = nx"—! # 
0. This implies either char F =O orcharF fn. O 


Definition. Let n be a positive integer, and let F be a field of characteristic 
zero or characteristic pn. Then the polynomial ®,(x) = I1,,(x — @), 
where the product runs over all the primitive nth roots w of unity (i.e., the 
primitive nth root of x"—1 over F) is called the nth cyclotomic poly- 


nomial. 


For example, ®,(x)=x-—1, ®(x)=x+1, Ox) =x?+x+4 1, 
®,(x) = x72 +1, Ox) = x4 + x8 + x7 +x4+ 1, O (x) = x2? — x 4+ 1. 


1.3 Theorem. ®,(x) = I1,,(x — w), @ primitive nth root in C, is an 
irreducible polynomial of degree d(n) in Z[x]. 


Proof. Let E be the splitting field of x” — 1 € Q[x]. Because the character- 
istic of Q is zero, E is a finite, separable, and normal extension of Q. SoQ 
is the fixed field of G(E/Q). Because for any primitive nth root w of unity 
and any o € G(E/Q), o(w) is again a primitive mth root of unity, the 
induced mapping o*: E[x] —> E[x] keeps ®,(x) unaltered. Thus, the 
coefficients of ®, (x) lie in the fixed field of G(E/Q)); that is, ®,(x) € Q[x]. 

But then the facts that ®,(x) is a factor of x" — 1 and ®,(x) is monic give 
®, (x) € Z[x]. Because the number of primitive mth roots of unity is 6”), 
this also shows that ®,(x) is of degree $(n). 
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We now show that ®,(x) is irreducible over Z. Let f(x) € Z[x] be an 
irreducible factor of ®,(x), and let w be a root of f(x). Of course, w is a 
primitive mth root of unity. We shall prove that if p is a prime such that p 
does not divide n, then w? is also a root of f(x). First note that w? is alsoa 
primitive mth root of unity. This follows from the fact that w? is also a 
generator of the cyclic group consisting of the roots of x" — 1. 

Because f(x) € Z[x] is a factor of ®,(x), there exists A(x) € Z[x] such 
that 


®,,(x) = f(x)A(x). 


So if w? is not a root of f(x), it must be a root of A(x). Thus, @ is a root of 
h(x”). So f(x) and A(x?) have a common factor over some extension of Q. 
But this implies f(x) and A(x?) have a common factor over Q. (To prove 
this, use Euclid’s division algorithm.) Because f(x) is irreducible over Z 
and also over Q, we get {(x) divides A(x”). Write h(x?) = f(x)g(x). Be- 
cause f(x) and h(x?) are monic polynomials over Z, g(x) is also a monic 
polynomial over Z. Let us denote by /(x) and A(x), respectively, the 
polynomials f(x) (mod p) and A(x) (mod p) - that is, polynomials ob- 
tained from f(x) and A(x) by replacing their coefficients a € Z with a € 
Z/(p). Because a? = a (mod p) for all integers a, we get 
h(x?) = (h(x))?, 

so h(x?) = f(x)g(x) gives that h(x) and f(x) have a common factor. Thus, 
from ®,(x) = f(x)h(x) and ®,(x)|(x" — 1), we get that x” — 1 has multi- 
ple roots. But this is impossible. For if a is a multiple root, then the 
derivative of x"— 1 should vanish at x = a; that is, nma"~' = 0. This 
implies a"~! = 0, because the characteristic p does not divide n; soa = 0. 
Because a = 0 is not a root of x" — 1, we get a contradiction. Hence, we 
have shown that if w is a root of f(x), then w? is also a root of f(x). Because 
any primitive mth root of unity can be obtained by raising w to a succes- 
sion of prime powers, with primes not dividing 7, this implies that all 
primitive mth roots of unity are roots of f(x). Hence, f(x) = ®,(x). Thus, 
®, (x) is irreducible over Z. O 


1.4 Theorem. Let w be a primitive nth root of unity in C. Then Qw) 
is the splitting field of ®, (x) and also of x"—1€Q[x]. Further, 
[Q(w):Q] = A(n) = |G(Q(w)/Q)| and G(Q(w)/Q) = (Z/(n))*, the mul- 
tiplicative group formed by the units of Z/(n). 


Proof. The minimal polynomial of w is ®,(x) by Theorem 1.3, and be- 
cause Q(w) contains a primitive mth root of unity, it contains all mth roots 
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of unity; therefore, Q(w) is the splitting field of D, (x) and of x" — 1. Also, 
[Q(w): Q] = degree of ©, (x) = d(7). Because Q(w) is a finite, separable, 
and normal extension of Q, 

IG(Q(@)/Q)| = [Q(@): Q]. 


If o € G(Q(w)/Q)), then o(q@) is also a primitive nth root of unity; so 
o(w) = w’, where v < nand(v,n) = 1. Denote a by o,. We know there are 
dn) such v’s, and they are precisely the members of the group (Z/(”))*. 
Let 


Sf: (Zf(n))y* — G(Q(@)/Q) 


be a mapping defined by /(v) = o,. Then it is easy to see that f is both a 
1-1 and an onto mapping. To check that fis a homomorphism, let v,, 
v, € (Z/(n))*. Write v,v, = gn + rwith r < n. By definition of multiplica- 
tion in (Z/(n))*, v,v. = r. Now 


MQ" = EMt! = qw’ 
So 

f(¥,v2) = f(r) =0,= Ov.v, = 0,,9., = f(v,)f(v2). 
Hence, (Z/(n))* = G(Q(w)/Q). OG 
Remark. If p is an odd prime, then (Z/( p*))*, the multiplicative group 
formed by the units of the ring Z/(p‘), is cyclic. The proof of this result is 
quite technical. The interested reader may find the proof in any advanced 


text. For p = 2 the following example [Example |.5(a)] shows that the 
result is not necessarily true. 


1.5 Examples 
(a) D(x) and x* — | have the same Galois group, namely, (Z/(8))* = 
(1,3,5,7)}, the Klein four-group. 


Solution. Follows from Theorem 1.4. 


(b) The Galois group of x‘ + x? + | isthe same as that of x° — 1 and is 
of order 2. 


Solution. Note x4 + x? + 1 = z2+2+ 1, where z = x2. But 27+ 2+ 1 
is the minimal polynomial for a primitive third root of unity. So the 
splitting field of x* + x? + 1 will contain the square roots of e27"3 and 
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e*=/3, Thus, we need to adjoin 


er/3 = (e2RH/3) 1/2, e4xt/3 = — (e273) 1/2, e2xt/3 = (e4%43)1/2, 
e5nt/3 = — (e4*43) 1/2, 


So E = Q(a), where a = e*/>, is the splitting field of x4 + x? + 1 € Q[x]. 
Because @ is a primitive sixth root of unity, E is the splitting field of 
x® — |. Then G(E/Q) = (Z/(6))* = (1,5) is the group of order 2. 


2 Cyclic extensions 


Definition. Let E be a Galois extension of F. Then E is called a cyclic 
extension of F if G(E/F) is a cyclic group. 


2.1 Examples of cyclic extensions 


(a) If @ is a primitive pth root of unity and E = Q(qw) is the splitting field of 
x? — | € Q[x], then E is a Galois extension of Q because x? — | is a 
separable polynomial. Also, E is a cyclic extension. 

(b) All finite extensions of finite fields are separable. Thus, the splitting 
field E of a polynomial f(x) over a finite field F is a Galois extension. 
Furthermore, by Example 4.9(d) in Chapter 16, G(E/F) is cyclic. There- 
fore, all splitting fields over finite fields are cyclic extensions. 


The proof of the following useful result is easy and 1s left as an exercise 
for the reader. 


2.2 Proposition. Let F be a field of nonzero characteristic p. Then for 
every positive integer k the mapping 2, of F into itself, defined by n(x) = 
x°* for all elements x of F, is an embedding of F into itself. (The mapping 
1 (x) = x? is called the Frobenius endomorphism.) 


2.3 Lemma. Let E be a finite extension of F. Suppose f: G— E*, 
E* = E — {0), has the property that f(on) = a( f(n))f(o) for all a,n € G. 
Then there exists a€ E* such that f(a) = o(a™')a for alla € G. (The 
mapping f in the hypothesis of the lemma is called a crossed homomor- 
phism.) 


Proof. For all n € G, f(n) € E*, so f(n) # 0. Thus, if 
> A(n)n(b) = 0 
nEG 


for all b € E*, then by the Dedekind lemma (Lemma 1.4, Chapter 17), 
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J(n) = 0, which is not true. Hence, there exists b € E* such that 


aS (n)n(b) = a # 0. 


Then for any o € G, we get 
¥ a(f(n)n(b)) = o(a), 
nEG 


which gives 


D 2(f(n))o(n(b)) = (a). 
7EG 


Thus, by using o( f(7)) = (/(0))~ (on), we have 
2S (a))~ 'f(an)on(b) = o(a). 
He 


But 
{onln € G) = (nln € G). 
Hence, 


(f(a))-! 2 Sinn) = 0(a). 


This implies 
(f(a))~'a = a(a). 
Equivalently, we get 
ao(a—') = f(a). 


This completes the proof. O 


2.4 Lemma (special case of Hilbert’s problem 90). Ler E be a finite 
extension of F, and letG = G(E/F) bea cyclic group of order n generated 
by o. Ifw & E be such that wo(w)a*(w)---0"~ *(w) = 1, then there exists 


a € E* such that w = o0(a)a~'. 
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Proof. By Lemma 2.3 we need only define /: G — E* such that {(a) = w 
for some o € G and fis a crossed homomorphism. Define /: G — E* as 


follows: f(1)=1, f(o)=o, and f(o')=a'" (w)---o(w)w for 2<i< 


n— 1. We now check that fis a crossed homomorphism; that is, f(o7) = 
o(f(n)) f(a). Take o', o/ € G. If i + j = 0 (mod n), then f(a'a/) = f(a") = 


1; also 


a'( f(a) f(a!) = a'(a!" (w) --- o(a)wXa'"(w) + o(wa)en) 


= 0" '(w)o"~ 2(w)---0(w)w = 1, by hypothesis. 
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If i+j is not a multiple of n, then 

S(a'o!) = f(a'*/) = f(a") = o—"(w)a"{w) ++: (wu, 
where i + j = gn + rand r <n. Next consider 


o'f(a/)f(a') = a'(a—"(w) +> o(w)wXa'"(w) °°: o(w)w) 
= ao !*))-"\(@)a"'*)-(q@) eee o(w)w 
= a’ (w)a’-%{w) eee o(w)w. 


Hence, fis a crossed homomorphism. 0 


We may remark that the condition wa(w)o?(w)---a"~ '(w) = 1 is the 
usual condition that the norm of w in E over F is 1, i.e., Ngjp(w) = 1 
(see Problem 4). 


2.5 Theorem. Let F contain a primitive nth root w of unity. Then the 
following are equivalent: 


(i) Ejisa finite cyclic extension of degree n over F. 
(ii) Evisthe splitting field ofan irreducible polynomial x" — b € F[x]. 


Furthermore note E = F(a), where « is a root of x" — b. 
Proof. (i) => (ii) Let a be a generator of the finite cyclic group G(E/F). By 


Lemma 2.4 there exists a € E* such that o(a) = wa. Then it follows 
easily that, for all i= 1,2,..., 

o'(a) = w'a. (1) 
Then for all i= 1,2,..., 

o'(a") = (o(a))" = wa" = a". 
Thus, a" & F, and if b= a", then x" — bE F[x] and x"-5b=T17., 
(x — w'a). 

We show that x" — b € F[x] is irreducible over F. Suppose x" — 5 = 


J(x)g(x), where f(x) is a nonconstant irreducible monic polynomial over 
F. If w‘a is one root of /(x), then for each positive integer j we have 


o/— (wa) = o/-(w')o— (a). 
But w! € F. So o/—“w') = w'. Also, from (1), o/-{a) = w/-'a. Thus, we 
have 

o/-"(w'a) = w'ws—'a = wa. 


Because any F-automorphism maps a root of a polynomial over F onto a 
root of that polynomial, we get that w/a is also a root of f(x). Hence, all the 
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roots of x” — b are roots of f(x). Thus, f(x) = x" — 6b, which proves that 
x" — bis irreducible over F. Note that we have also shown that E = F(a), 
where a is a root of x” — 5, is a splitting field of x" — 6 over F. Thus, 
(i) = (11) and (111). 

We now show (ii) = (i). Let cE E be a root of x” — bE F[x]. So 
b =c". Clearly, then, c,cw,cw?,...,cw"~' are n distinct roots of x” — 6, 
where w € F isa primitive mth root of unity. Thus, x” — b is a separable 
irreducible polynomial. Hence, E = F (c) is a Galois extension of F. For 
each o € G(E/F), let x(a) be defined by 


Xa) = (k E Zla(c) = w*c}. 


Then x(a) ¥ © because a(c) is also a root of x” — 6. Moreover, for any 
k € x(a), x(0) = k € Z/(n), for w*c = w/c if and only if k = j (mod n). 
Further, if o,c € G(E/F) and if o(c) = w*c and t(c) = w/c, then 


(otc) = a(w/c) = w/a(c) = wit*e, 


X(ot) = x(a) + X(t), 

where the sum on the night is interpreted as the binary operation in the 
additive group Z/(n). 

Finally, if y(¢) = 0, then o(c) = c. So o = identity on E because E = 
F (c). Consequently, x is an isomorphism from G(E/F) onto a subgroup 
of the additive group Z/(n). Also, [F(c): F] = degree of the minimal poly- 
nomial of c over F = degree of x” — 5, that is, n. Because E = F(c) is a fi- 
nite, separable, normal extension of F, |G(E/F)| = [E:F] =n. Because 
G(E/F) is isomorphic to a subgroup of Z/(n), it follows that 


G(E/F) = Z/(n). O 


Problems 


1. Ifa field F contains a primitive mth root of unity, then the charac- 
teristic of F is 0 or a prime p that does not divide n. 

2. Let F contain a primitive nth root of unity, and let E be the 
splitting field of x"—b over F, where m|n and m is prime. 
Then either E = F or x"—b is irreducible over F. What can 
you say if m is not prime? 

3. Let pbea prime and let F bea field. Prove that x? — b € F[x] is 
reducible over F iff its splitting field is F or F(w) according to 
whether char F = por char F ¥ p, where w is a primitive pth root 


of unity. 
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4. Let E be a finite separable normal extension over F and let 
G(E/F) = {o, = 1, o3,...,0,}. If a€E we define 


T gp(@) = 2 aa) and N E/F(@) = int o,(a) 

and call these respectively the trace and norm of ain E over F. 

Show: 

(a) T,,-(a)eF, Nz plajeF. 

(b) 7;,, 1s an F-linear map of the vector space E over F. 

(c) Neg is a group homomorphism from the group E* = 
E — {0} to the group F* = F — {0}. 

(d) If G(E/F) is a cyclic group generated by o, then N;,-(a) = 
1 if and only if there exists beE such that a =(o(b))~ 'b. 
(Hint: Generalize Lemma 2.4.) 


3 Polynomials solvable by radicals 


The object of this section is to use Galois theory to find necessary and 
sufficient conditions for a polynomial over a field F to be solvable by 
radicals. We also construct a polynomial of degree 5 that is not solvable by 
radicals. Before we prove these results, we need to explain the terms used. 


Definition. An extension E of a field F is an extension by radicals (or 
radical extension) if there are elements a,,...,a, © E and positive integers 
N, 4.4, Such that E = F(aQ,,...,0,), a € F, anda? € F(ay,...,),-1), 1 < 
isr. 


For example, Q(02) is a radical extension of Q. Also, Q(V2, V3) is a 


radical extension of Q. 
A useful fact about radical extension is 


3.1 Remark. If E, is a radical extension of F = E, with intermediate 
fields E,,...,E,_, (written in ascending order), then there exists a radical 
extension E} of F = E, with intermediate fields £},...,E3_, (written in 
ascending order) such that 
(i) EVDE,. 
(ii) £% is a normal extension of F. 
(iii) E, is a splitting field of a polynomial of the form x” — 5,€ 
E\~\{x], i= 1,...,8. 
To prove this remark, let us assume that we have an ascending chain of 
fields F= E,C E,C ::: CE, starting with F such that E, = E,_,(a,), 
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where a isa root of x" — a; € E,_,[x], i = 1,...,r. Let w bea pnmitive nth 
root of unity, where n = n, «+: n,. Consider the tower 


E,(w) 


E9(w) = F(w) 


Ey =F 
Clearly, E,(w) is a radical extension of F. Also F(w), being a splitting field 
of x" — 1 € F[x], is a normal extension of F. Thus, F is the fixed field of 
G(F(w)/F), so the polynomial 


A) = (x — o(a,)) 
o€G(F(w)/F) 

belongs to F[x]. (Here f(x) = (x"' — a,)*, k =|G(F(w)/F)|, since a, € 
F.) 

Let g,(x) = (x" — 1)f,(x). Then g,(x) € F[x]. Let K be the splitting field 
of g,(x) over F. Then K is a normal extension of F. Clearly, a, € K, 
w € K,and E, C K. It is also clear that there is a finite ascending chain of 
fields between F and K such that each field is a splitting field of a polyno- 
mial of the form x” — b over the preceding field. Next we construct a field 
L such that L contains the fields K and E, and is a normal extension of F. 
So we consider a polynomial g,(x) = g,(x)/,(x), where f(x) = Tleeguyry 
(x: — o(a,)). Because K is anormal extension over F, f(x) € F[x]. Thus, 
g(x) € F[x]. Let L be the splitting field of g.(x) over F. Then L contains 
a, and K, and, hence, E,(a,) = E, C L. Therefore, L is a normal exten- 
sion of F containing E,. Further, in view of the nature of the polynomial 
g,(x), it is clear that there exists a finite ascending sequence of interme- 
diate fields between K and L such that any member of the sequence is a 
splitting field of a polynomial of the form x” — 6 over the preceding 
member. Continuing like this, we can construct a radical extension E} of 
F having the desired properties. This proves the remark. O 


Definition. A polynomial f(x) € F [x] overa field F is said to besolvable by 
radicals if its splitting field E is contained in some radical extension of F. 


We note that a polynomial f(x) € F[x] is solvable by radicals if we can 
obtain every root of f(x) by using a finite sequence of operations of 
addition, subtraction, multiplication, division, and taking nth roots, 
starting with elements of F. 

Throughout this section we assume that all fields are of characteristic 


zero. 
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3.2 Theorem. f(x) € F[x] is solvable by radicals over F if and only if 
its splitting field E over F has solvable Galois group G(E/F). 


Proof. First suppose that G(E/F) is solvable. Because the characteristic of 
F is zero, E is a normal separable extension. So [E: F] =|G(E/F)| = n, 
say. Assume first that F contains a primitive nth root of unity. Then F 
contains primitive mth roots of unity for all positive integers sn that divide 
n. Let G = G(E/F). Because G 1s solvable and finite, there is a chain 
G=G) 2G, > °*: DG,= (e) of subgroups of G such that G; 4 G,_, and 
G,_,/G;, is cyclic. Let F= F, C F, C -*> C F,= E be the corresponding 
subfields of E given by the fundamental theorem. Then E, = F; and 
G(E/F,;)=G;. Also, by the fundamental theorem, G, = G(E/F,) <4 
G(E/F) = G implies F, is a normal extension of F. 


E G(EIF)=G 
Vv 
F, G(E/F,)=G, 
(normal) U 
F G(E/E) = (ec) 


Now E can be regarded as the splitting field of f(x) over F,. So E is a 
finite normal extension of F,. Then G, <1 G, implies that F, is a normal 
extension of F;. 


E G(E/F,)= G, 
V 
Fy G(E/F,)= G, 
(normal) U 
Fy G(E/E) =e 


Continue in this way to show that F; is a normal extension of F;_,. 


E G(E/F; .,)=G,.. 4 
Vv 
F, G(E/F,) = G, 
(normal) U 


Fi_y G(E/E) = (e) 
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Furthermore, G(F,/F;_,) = G(E/F,_,)/G(E/F,) = G,_,/G, by the funda- 
mental theorem. So F; is a cyclic extension of F,..,. Then by Theorem 2.5, 
F, is the splitting field of an irreducible polynomial x” — 6, € F;_ ,[x] and 
F, = F,_,(a,), wherea?' = b, € F,;_,. Then E = F(a,,...,a@,),a7' © F,and 
a” F,_,= F(a,,...,@,_,) for 1 < is r. Thus, f(x) is solvable by radicals 
over F. 

Next we drop the assumption that F contains a primitive mth root of 
unity. The polynomial x" — | € E[x] has roots in E. Let p be a primitive 
nth root of unity lying in E. Then E(p) is the splitting field of f(x) regarded 
as a polynomial over F(p). Any F(p)-automorphism o of E(p) will leave 
the coefficients of the polynomial f(x) unaltered. Now, for any automor- 
phism o € G(E(p)/F(p)), we have 09 = o|, € G(E/F), since E isa normal 
extension of F. Further, the map o> dp is a 1-1 homomorphism of the 
group G(E(p)/F(p)) into G(E/F). Then since a subgroup of a solvable 
group is solvable, G(E(p)/F(p)) is solvable. Now by the first part, E(p) isa 
radical extension of F(p); so E(p) is a radical extension of F. Then the 
splitting field E of f(x) € F [x] is contained in the radical extension E( p) of 
F, so f(x) is solvable by radicals. 


Before we prove the converse, we prove a lemma that deals with a 
particular case. 


3.3 Lemma. Let E be the splitting field of x" —a€ F(x]. Then 
G(E/F) ts a solvable group. 


Proof. lf F contains a primitive mth root of unity, then by Example 2.2(h) 
in Chapter 17 we know that G(E/F) is abelian and, hence, solvable. Now 
suppose that F does not contain a primitive mth root of unity. Let p € F be 
a generator of the cyclic group of the mth roots of unity. Let b be a root of 
x"— a. Then bp is also a root. So p = b— (bp) is in the splitting field E of 
x"— a € F[x]. Consider F C F(p) C E. F(p) is a normal extension of F, 
since F(p) is the splitting field of x” — 1; so G(E/F(p)) is a normal 
subgroup of G(E/F) by the fundamental theorem of Galois theory. 


E G(E/F) 
V 
F(p) G(E/F(p)) 
(normal) U 
F (e) 


But G(E/F(p)) is abelian, because E is the splitting field of x" — a € F [x]. 
So (e) I G(E/F(p)) <1 G(E/F) is a normal series. Again by the funda- 
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mental theorem of Galois theory, G(E/F)/G(E/F(p)) = G(F(p)/F), 
which is abelian (being isomorphic to (Z/())* by Theorem 1.4 because 
F (p) is the splitting field of x" — 1). So G(E/F) has a normal series with 
abelian factors whose last element is the trivial group. Therefore, G(E/F) 
is solvable. O 


We are now ready to complete the proof of the theorem. 


In view of Remark 3.1, if a polynomial f(x) € F[x] is solvable by 
radicals, we may, without any loss of generality, assume that the splitting 
field E of f(x) is contained in a radical extension E, of F such that E, is a 
normal extension of F and there exist intermediate fields E,,...,£,_, such 
that E, is a splitting field of a polynomial of the form x™ — 6, € E,_,[x). 
Thus, by the fundamental theorem of Galois theory 


(e) C G(E,/E,_,) C G(E,/E,-2) C +** C G(E,/F) 
is a normal series. Also 

G(E,/E,—;)/G(E,/E,—i41) = G(E,—i41/E,-1) 
is solvable by Lemma 3.3. Then 

(e) CG(E,/E,_,) C G(E,/E,-2) C «** C G(E,/F) 


is a normal series with solvable quotient groups, so G(E,/F) is solvable. 
Further, since G(E/F) = G(E,/F)/G(E,/E), G(E/F) is a homomorphic 
image of G(E,/F). Hence, it is solvable. O 


Remark. We know that the symmetric group S, is not solvable if n = 5. 
Thus, any polynomial whose Galois group is S,,, 2 = 5, is not solvable by 
radicals. In Problem 1 (a) we give a polynomial of degree 5 whose Galois 
group is S,. This shows that not all polynomials of degree 5 are solvable by 
radicals. 


Historical background 


The result that a general polynomial of degree 5 is not solvable by radicals 
was discovered by Abel in 1824. But the general problem of finding a way 
of deciding whether or not a given polynomial could be solved by radicals 
was not completed by Abel, though he was working on it when he died in 
1829. A young Frenchman, E. Galois, worked on this problem and sub- 
mitted his memoirs to the Academy of Sciences in Paris. But before he 
could get recognition, he died in a duel in 1832. It was Liouville who, 
while addressing the Academy in 1843, announced the profound work of 
Galois regarding the solvability of a polynomial by radicals. 
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Before we give applications of Theorem 3.2, we would like to recall the 
important fact [Example 2.2(a), Chapter 17] that the Galois group of a 
polynomial f(x) € F[x] having r distinct roots is embeddable in the sym- 
metric group S,, the group of all permutations of the r distinct roots 
(C1, ,...,Q,). 

We call a subgroup H of S, a transitive permutation group if, for all 
i,j € (1,2,...,), there exists o © H such that o(!) = /. 

We also recall the following result from group theory. 


3.4 Theorem. Jf p is a prime number and if a subgroup G of S, is a 
transitive group of permutations containing a transposition (a b), then 
G=5,. 


We now prove 


3.5 Theorem. Let f(x) bea polynomial over a field F with no multiple 
roots. Then f(x) is irreducible over F if and only if the Galois group G 
of f(x) is isomorphic to a transitive permutation group. 


Proof. Let a,,...,@, be the roots of f(x) in some splitting field E. Then for 
each gE G, o(a,),...,0(@,) 1S a permutation of a@,,....a,. By Example 
2.2(a) in Chapter 17, we may look upon G as a subgroup of S,. 

First assume f(x) is irreducible over F. Then for each j = 1.,...,7, 


F(a,) = F[x)(f(x)) 


in which a, x + (f(x)), a7 a + (f(x)), a € F. This isomorphism in- 
duces the isomorphism 7: F(a,) — F(a,), where a, a, and a+ a,a& 
F. But since E is a normal extension of F, 7 can be extended to an 
F-homomorphism 7*: E — E. Then 4* € G(E/F) and 7*(a,) = a,. Thus, 
G is a transitive permutation group. 

Conversely, let G be transitive. Let p(x) be the minimal polynomial for 
a, over F. Suppose a, is any root. Because G is transitive, there exists 
ao &€ Gsuch that o(a,) = a,. Then p(a,) = p(o(a,)) = op(a,) = 0. Hence, 
each a, is a root of p(x). Because p(x)|/(x), it follows that f(x) = cp(x), 
c € F. Thus, f(x) is irreducible over F. O 


Next we prove 


3.6 Theorem. Let f(x) € Q[x] be a monic irreducible polynomial over 
Q of degree p, where p is prime. If f(x) has exactly two nonreal roots in C, 
then the Galois group of f(x) is isomorphic to S,. 


354 Applications of Galois theory to classical problems 


Proof. Let E Cc C bea splitting field of f(x) over Q. Then by Theorem 3.5, 
G(E/Q) is isomorphic to a transitive permutation group H, which is a 
subgroup of S,. Let a,,...,@, be roots of f(x), and let a, be its complex root. 
Because f(x) € Q[x], a, is also a root of f(x). Hence, a, = a, for some 
1 sj = p,j * i. Consider the embedding a: z > z from E toQ. Because E 
is anormal extension of Q, o maps E onto E. Thus, o € G(E/Q). Then the 
permutation of the roots a, ,...,a, of f(x) corresponding to the element 0 
of the Galois group G(E/Q) takes a, to a; and a; to a,, and keeps all a, 
(k # i, j) fixed. Hence, by Theorem 3.4, H = S,. Thus, G(E/Q) = S,, as 
required. O 


3.7 Examples 


(a) Show that if an irreducible polynomial p(x) € F[x] overa field Fhasa 
root in a radical extension of F, then p(x) is solvable by radicals over F. 


Solution. Let E, be a radical extension of F. By Remark 3.1 there exists a 
radical extension E; of F such that E; > E, and E; is a normal extension 
of F. Because p(x) is irreducible over F and has a root in E,, it has a 
root in E{. But because E; is a normal extension of F, it follows that 
E’, contains a splitting field of p(x). This shows that p(x) is solvable by 
radicals. 


(b) Show that the polynomial x’ — 10x*° + 15x + 5 is not solvable by 
radicals over Q. 


Solution. By Eisenstein’s criterion f(x)=x’—10x°+15x+5_ is 
irreducible over Q. Further, by Descartes’s rule of signs it is known 
that 


the number of positive real roots 
<= the number of changes in signs in f(x) = 2, 


and 


the number of negative real roots 
<= the number of changes of signs in f(— x) = 3. 


Thus, the total number of real roots = 5. Moreover, by the intermediate 
value theorem there are five real roots, one in each of the intervals 
(—4,—-3), (—2,—1), (—1,0), (1,2), and (3,4). So f(x) has exactly two 
nonreal roots. By Theorem 3.6 the Galois group of f(x) is S,. Hence, by 
Theorem 3.2, f(x) is not solvable by radicals. 
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Problems 


1. Show that the following polynomials are not solvable by radicals 
over Q: 
(a) x°—9x+4+3 (b) 2x> —5x*4+5 
(c) x°—8x +6 (d) x°—4x+2 

2. Let F[{x,,xX2,X;] be a polynomial ring in x, ,.x,,x, over a field F. 
Let K = F(x,,X2,X;) be the field of rational functions (i.e., the 
field of fractions of the ring F[Xx,,x,,x3]). Suppose 


{H=C-—x,P +x At—x,€ K[d. 


Prove that the Galois group of f(t) over K is S,. Generalize this 
result to a polynomial of degree 71 (see Theorem 4.1). 


4 Symmetric functions 


In this section we give an application of Galois theory to the symmetric 
functions. Let F be a field, and let y,,..., y,, be 7 indeterminates. Cossider 
the field of rational functions F(y,,...,y,) over F. Ifo is a permutation of 
{1,...,”) - that is, 0 € S, — then o gives rise to a natural map 


CG: F(y, sy Vo) =e F(), svg De) 
given by 


Oo (2 gooey »)) _ F (Yor) -++» Yorn) 
464 535 Vn) B(Vorty s+++s Von), 
where f(Y4 5---) Vn) BV 9-0 Yn) & FLV 5+ Yn] and 2(Y, 5... Yn) ¥ O. It is im- 


mediate that o is an automorphism of F(),,..., ),,) leaving each element of 
F fixed. 


Definition. An element f(),...-. ¥n)/@QV' 0-00 Ya) OF F(Vy,-- x) is called a 
symmetric function in },,..., y, over F ifit is left fixed by all permutations 
of l,...,n, that is, for alla € S,, 


5 |= | — SV 9 Vn) 
BV Kn BV ys Va) 


Let S, be the group of all F-automorphisms @ of F()),,...,¥,) corre- 
sponding to gE S,. Obviously, S, = S,. Let K be the subfield of 
F(y,,---,¥,) that is the fixed field of S,. Consider the polynomial 


S00 = T] (x—- 9»). 


i=] 
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Now f(x) € F(y,,.--> ¥,)[x]. Clearly, the natural mapping 


FY 50002 Yn (X] > FM, 50-9 Va dX] 


induced by eacha € S, leaves f(x) unaltered. Thus, the coefficients of f(x) 
are unaltered by each o € S,. Hence, the coefficients lie in the fixed field 
K. 

Let us write the polynomial f(x) as x" + a,x"~' + a,x7?2 + +: +4,, 
where a, € K. 


Definition. If a, is the coefficient of x"~' in the polynomial f(x) = 
17, (x—y,), then (— 1)'a, is called the ith elementary symmetric function 
in y,,...Y, and is denoted by S,. 

Thus, 


S= Yt yt sled + ns 
Sp = Viy2 FViYs toe? H+ Vn- n> 


Sp = Vid2°°" Yn: 


We now prove the following theorem. 


4.1 Theorem. Let 5,,...,5, be the elementary symmetric functions in 
the indeterminates y,,....¥,. Then every symmetric function in yy Vn 
over F is a rational function of the elementary symmetric functions. Also, 
Fy, s...Y,) iS @ finite normal extension of F(S,,....5,) of degree n!, and the 
Galois group of this extension is isomorphic to S,. 


Proof. Consider the field E = F(s,,...,5,). Because K is the field of all 
symmetric functions in y,,...,y, over F, E C K. Also, because F(), ,...,),) 1S 
a splitting field of the polynomial f(x) = II7_, (x— y,), of degree n over E, 
we have, by Theorem 3.6 in Chapter 15, 


[FV 1+ Va) BE] S al. (1) 
Further, as discussed in the beginning of this section, 
[FY +n) K) 21S qh al. (2) 


But since E C K, we obtain from (1) and (2) that E = K. 
Now /(x) is a separable polynomial over E, and F(),,...,y,) iS its split- 
ting field. Thus, F(y,,...,y,) is a finite, separable, normal extension of E. 
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So 

[FW 5+ Yn) BE] = 1G(F(Y, .--Yn)/ EDI. (3) 
Because G(F(),,...,¥,)/E) is embeddable in S,,, and [F(),,...,y,): £] = n', 
we get from (3) that 

G(F(Y, 5. Ya /E) = S,,. 


Finally, the fact that K = E shows that every symmetric function can 
be expressed as a rational function of the elementary symmetric functions 
S150055,. O 


4.2 Examples 


We express the following symmetric polynomials as rational! functions of 
the elementary symmetric functions: 


(a) x? + x3 + x3. 
(b) (x, — X2)*%(x2 — X3)°(x3 — x1). 


Solution. (a) 
(x? + x3 + x3) = (x, + 2X2 + 2X3) — Wx XQ + XEX3 + X5X,) = 57 — 253, 


where s, and s, are elementary symmetnic functions of x,, x, and x3. 


(b) By simple computation it can be checked that 
" 13> y2 2~ 3° y3 I~ 3 
are the roots of x? + 3ax + 8 = 0, where 
— 5? 
am + 52, Br-s-=+—. 


Then the cubic equation whose roots are (y, — y,)’, (¥. — y3)?, and 
(Ys — yi¥ is 
(3a + yP + 9a(3a + y)? + 2787 = 0. (1) 
Now 
(X= X2P(X2 — X3P7(X3 — KP = (Y, — Y2P(2 — VsPs — yy 


= product of all the roots of (1) 
= — 27( B? + 4a°). 
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Problem 


1. Express the following symmetric functions as rational functions 
of elementary symmetric functions: 
(a) x? +x3 4+ x}. 
(b) x2 x3 + x3 x9 + x3 x2. 
(c) (xt + x3Xx¥ + x9(xj + x7). 
(d)  (%) + X27 (x2 + x3)7(3 + x1)’. 


5 Ruler and compass constructions 


The theory of fields provides solutions to many ancient geometric prob- 
lems. Among such problems are the following: 


1. Toconstruct by ruler and compass a square having the same area 
as that of a circle. 

2. Toconstruct by ruler and compass a cube having twice the vol- 
ume of a given cube. 

3. To trisect a given angle by ruler and compass. 

4. To construct by ruler and compass a regular polygon having 2 
sides. 


For these we must translate the geometric problem into an algebraic 
problem. We shall regard the plane as the coordinate plane R? of analytic 
geometry. Let Py C R?. Assume Py has at least two points. We construct an 
ascending chain of subsets P, of R?, i = 0,1,2,..., inductively as follows: Let 
P,,, be the union of P, and the set of points obtained by intersection of (1) 
two distinct lines each passing through two distinct points in P,, or (11) two 
distinct circles each with its center in P, and passing through another point 
in P,, or (ili) a line and a circle of the types described in (i) and (i1). 

Suppose that the coordinates of points in Pp belong toa subfield K of R. 
The equation of a line passing through two distinct points in Py and the 
equation of a circle whose center is in P, and that passes through another 
point in P o are 


ax+by+c=0, a,b,c € K, (1) 
x?+y2?+29x+2fy+d=0, 2efdeKk, (2) 
respectively. 


It follows then that the coordinates of the point of intersection of two 
such lines (1) lie in X. Also, the coordinates of the points of intersection of 
a line (1) and a circle (2), as well as the coordinates of the points of 
intersection of two distinct circles (2), lie in K(Va,), a, > 0, a, € K. Like- 
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wise, we get that the coordinates of points in P, lie in K(Va,,...,.Va,), 
QL, 5-54, > 0, a, € K, a € K(Va,),..., a, € K(Va,,...,V04;- 1). 


Definition. (a) A point X is constructible from Py if X € P, for some ie 
{0, 1,2,...}. 

(b) A line | is constructible from Po if it passes through two distinct 
points in some P,, i € {0,1,2....}. 

(c) A circle C is constructible from Py if its center is in some P,, and it 
passes through another point in P,, i € {0,1,2....). 


From now on whenever a point_X (a line /, a circle C) is constructible 
from Q X Q, we shall also say that the point X (the line /, the circle C) is 
constructible. 


Definition. A real number u is constructible from Q if the point (u,0) is 
constructible from Q X Q, the subset of the plane R?. 


It then follows from all this that if u € R is constructible from Q, then 
there exists an ascending chain 
Q=K,CK,CK,c °:: CK, 
of subfields K, , K3,...,K, of R such that 


(i) ue K,. 
(ii) K,=K;,_, (a), 1s isn, where a? € K,_,. 


Thus, [K,: K,;_,] = 2, and, hence, [K,:Q] = 2”, msn. 
So we have shown 


§.1 Theorem. Let u € R be constructible from Q. Then there exists a 
subfield K of R containing u such that [K:Q] = 2™ for some positive 
integer m. 


§.2 Theorem. Let K be the subset of R consisting of numbers con- 
structible from Q. Then K is a subfield containing square roots of all 
nonnegative numbers in K. 


Before we prove this theorem, we prove a series of lemmas. 


§.3 Lemma. The following are equivalent statements: 


(i) a€R is constructible from Q. 
(ii) (a,0) is a constructible point from Q X Q. 
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(iti) (a,a) is a constructible point from Q X Q. 
(iv) (0,a) is a constructible point from Q X Q. 


Proof. (i) > (ii) Definition. 

(ii) => (iii) The circle (x — a + y? = a? is constructible because its 
center (a,0) is a constructible point, and it passes through a constructible 
point (0,0). Also, the line x = y is constructible because it passes through 
constructible points (0,0) and (1,1). The point (a,a) is clearly a point of 
intersection of the circle and the line. Hence, (a,a) is constructible. 

(iii) => (iv) The circle x? + y? = 2a? is constructible because its center 
(0,0) is constructible, and it passes through a constructible point (a,q). 
Also, the line y = — x is constructible because it passes through two dis- 
tinct constructible points (0,0) and (1,— 1). One of the points of intersec- 
tion of this circle and this line is (—a@,a). This implies that (0,a) is a 
constructible point because it is the intersection of the constructible lines 
y = a[which passes through two distinct constructible points (— a,a) and 
(a,a)] and x = 0. 

(iv) = (11) Follows by symmetry. O 


Henceforth, whenever we say that a real number a is constructible, we 
mean that a is constructible from Q. 


5.4 Lemma. [fa is a constructible number, then x = a and y = a are 
constructible lines. 


Proof. \f a=0, then x = 0 is clearly constructible. So let a # 0. Then 
x =a passes through two distinct constructible points (a,0) and (a,a). 
Hence, x = a is constructible. Similarly, y = a is constructible. O 


5.5 Lemma. If a and b are constructible numbers, then (a,b) is a 
constructible point. 


Proof. (a,b) is the intersection of the constructible lines x = a and y = b. 
0 


5.6. Lemma. /fa and bare constructible numbers, then a + bare also 
constructible. 


Proof. (a + 6,0) are the points of intersection of the constructible line 
y= 0 and the constructible circle (x — a)? + y? = 6? (the center (a,0) is 
constructible; the point (a,b) through which the circle passes is construct- 
ible). O 
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5.7 Lemma. [fa and b are constructible numbers, then 


(i) abis constructible. 
(ii) afb, b #0, ts constructible. 


Proof. (i) The line ay = — x + abis constructible because it passes through 
constructible points (0,5) and (a,b — 1). The intersection of this line with 
the constructible line y = 0 is (ab,0). Hence, ab is constructible. 

(Note that we have used the fact that if b is constructible, then b — | is 
also constructible — a consequence of Lemma 5.6.) 

(11) If a = 0, then it is clear. So let a # 0. Then the line bx = a — }' is 
constructible because it passes through two distinct constructible points: 
(0,a) and (a,a(1 — b)). The intersection of this line with the constructible 
line y = 0 is (a/b,0). Hence, a/b is constructible. © 


5.8 Lemma. If a > 0 is constructible, then Va is constructible. 


Proof. The point (1 Va) is a point of intersection of the constructible circle 


I+a\ = (Ltay’ 
(« 2 J+ y= 2 


{which passes through the constructible point (0,0) and which has con- 
structible center ((1 + a)/2,0)] and the constructible line x = |. Thus, 
(1,Va) is a constructible point. 

Next, (0,2Va) is also a constructible point because it is a point of 
intersection of the constructible circle (x — 1)? + (y— Va¥ = a+ 1 and 
the constructible line x = 0. Therefore, 2Va is a constructible number. 
Then by Lemma 5.7, Va is a constructible number. © 


Proof of the theorem. Follows from Lemmas 5.6-5.8. 0 


5.9 Theorem. If ueK,,, where Kj =QCK,c K,c-:-cK,, is an 
ascending tower of fields K; such that [K;:K;_ ,] = 2, then u is construc- 
tible. 


Equivalently, if [Q(u):Q] = 2' for some t > 0, then uis constructible. 


Proof. Since rationals are constructible, the proof follows from Lemmas 
5.6-5.8. 


Definition. An angle a is constructible by ruler and compass if the point 
(cos a, sin @) is constructible from Q X Q. 
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5.10 Remark. The point (cosa, sina) is constructible from Q x Q if 
and only if cos @ is a constructible number (equivalently, if and only if 
sin a is a constructible number). 


Proof of the remark. Let (cos a, sin a) be a constructible point. Then 
(2 cos a, 0) is a point of intersection of the constructible circle (x — 
cosa} + (y — sina = | and the constructible line y = 0, so (2 cos a, 0) 
is a constructible point. Thus, 2 cos @ is a constructible number. So 
by Theorem 5.2, cos @ is a constructible number. Conversely, assume 
that cos @ is a constructible number. Then by Theorem 5.2, sin @ is 
also a constructible number. This yields, by Lemma 5.3, that the points 
(cos a, 0), (cos a, cos a), (0. sin a), and (sin a, sin a) are constructible 
points. This means that the lines x = cos aand y = sina are constructible 
lines. Hence, their intersection, namely, the point (cos a, sin @), 1S a 
constructible point. The statement in parentheses can be proved the same 
way. O 


5.11 Examples 


(a) Problem of squaring a circle 


It is impossible to construct a square equal in area to the area of a circle of 
radius 1. 


Solution. Assume we have a circle of unit radius. If a is the side of the 
square whose area is equal to that of this circle, then a? = z. But since z is 
not algebraic over Q (Hardy and Wright 1945), a? and, hence, a is not 
algebraic over Q. So [(Q(a):Q] # 2” for any positive integer m. Hence, by 
Theorem 5.1, a is not constructible by ruler and compass. 


(b) Problem of duplicating a cube 


It is impossible to construct a cube with a volume equal to twice the 
volume of a given cube by using ruler and compass only. 


Solution. We can assume that the side of the given cube is 1. Let the side of 
the cube to be constructed be x. Then x? — 2 = 0. So we have to construct 
the number 2!) (the real cube root of 2). Because x3 — 2 is irreducible 
over Q, 


[Q(2'3): Q] = 3 ¥ a power of 2. 


Thus, by Theorem 5.1, 2'/ is not constructible from Q by ruler and 
compass. 
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(c) Problem of trisecting an angle 
There exists an angle that cannot be trisected by using ruler and compass 
only. 


Solution. We show that the angle 60° cannot be trisected by ruler and 
compass. Now if this angle can be trisected by ruler and compass, then the 
number cos 20° is constructible from Q. This is equivalent to the con- 
structibility of 2 cos 20° from Q. Set a = 2 cos 20°. Then from cos 36 = 
4 cos?6 — 3 cos 8, we deduce a3 — 3a — | = 0. Because the polynomial 
x3 — 3x — 1 € Q[x] is irreducible over Q and has a root a, it follows that 


[Q(a):Q] = 3 # power of 2. 


Thus, by Theorem 5.1, @ = 2 cos 20° (or, equivalently, an angle of 20°) 
cannot be constructed by ruler and compass from Q. This completes the 
solution. 


(d) Problem of constructing a regular n-gon 


; 2n. 
A regular n-gon is constructible (equivalently, the angle — 1s construc- 
n 


tible) if and only if g(n) is a power of 2. 


2x... 2n ete 
Solution. Let w = cos — + isin —, where w is a primitive nth root of 
n n 


. e 2n 2n 
unity. Then w + @ = 2cos —. Set u=cos—. To show that u 1s con- 
n 


n 
structible, we need to prove that [Q(u):Q] = 2", k>0. Consider the 
following tower: 


Q(w) 
QW) 
Q 
Now (« — cos any = — sin? = and so w* — 2cos an + 1 =0. Thus 
w satisfies " " : 


x? — (2 Cos = \s + 1EQ(u), 


which is clearly an irreducible polynomial over Q(u), proving that 


364 


Applications of Galois theory to classical problems 


[Q(w):Q(u)] = 2. Since by Theorem 1.4 [Q(w): Q] = o(n), [(Q(u):Q] = 


+9(n). This shows that u is constructible if and only if p(n) is a power 


of 2. 


Problems 


1. 


2. 


3. 
4. 


Show that the angle 27/5 can be trisected using ruler and com- 


pass. 
Show that it is impossible to construct a regular 9-gon or 7-gon 


using ruler and compass. 
Show that it is possible to trisect 54° using ruler and compass. 
Prove that the regular 17-gon is constructible with ruler and 


compass. 
Find which of the following numbers are constructible: 


(i) /3 +1. (ii) x? +1. 
(iii) / /3-—14+1. (iv) 3/241. 
(v) ¥/2+/5. 


PART V 


Additional topics 


CHAPTER 19 


Noetherian and artinian modules and rings 


| Hom,(@M,, @©M,) 


We begin this chapter by proving a result that is quite useful in studying 
the ring of endomorphisms of a direct sum of modules and that has 
applications in subsequent sections. Unless otherwise stated, all modules 
considered in this chapter are left modules. 


1.1. Theorem. Let M=@2*,M, bea direct sum of R-modules M,. 
Then 


Hom,(M, »M,) Hom a(M,,™M,) — Hom ,(M,,M;) 

Hom,(M, »M;) Hom,(M,,M,) oT Hom,(M,,M,) 
Hom,(M,M) = 

Hom,(/,,M,) Hom,(M,,M,) °°: Hom,(M,,M,) 


as rings. (The right side is a ring T, say, of k X k matrices f = (f,,) under 
the usual matrix addition and multiplication, where f,, € Hom,(M,,M,)). 


Proof. Let 6 € Hom,(M,M). Let A,:M,— M and 2,:M— M, be the 
natural inclusion and projection mappings, respectively. Then 2,0/, € 
Hom ,(M,,M,). Define a mapping 0: Hom,(M,M) — T by setting o(¢) 
to be the k X k matrix (2,A,) whose (i,j) entry is 7,pA,, where d € 
Hom ,(M,M). We proceed to show that o is an isomorphism. So let 
o,y © Hom,(M,M). Then 


a(h + y) = (xd + y)A,) = (2,HA,) + (2 wA,) = o(f) + o(y). 
367 


368 Noetherian and artinian modules and rings 
Further, 


k 
o(p)a(y) = (x, dA, 2, yA,) = (> nbinwvi,), 


by definition of multiplication of matrices. But since 24. ,A,2,= 1, it 
follows that 


a(d)o(y) = (x, dyA,)= o(py). 


Therefore, o is a homomorphism. 

To prove that a is injective, let o(@) = (x,6A,) = 0. Then 2,6, = 0, 
1 si, j sk. This implies 24, 2A, = 0. But since 2,2, = 1, we obtain 
oa, = 0, 1 sj Sk. Ina similar fashion we get @ = 0, which proves that o 
is injective. To prove that a is surjective, let f= (/,,) € T, where /,,: M, > 
M, isan R-homomorphism. Set @ = 2, ,A,f,,2;. Thend@ € Hom,(M,M). 
By definition of co, o(d) is the k Xk matrix whose (s,t) entry is 
N(Z, ASMA, = Sor, Because 2,A, = 6,,. Hence, o() = (,,) =f. Thus, 
a is also surjective. O 


Problems 


1. Let M=M, © M, be the direct sum of simple modules M, and 
M, such that M, # M,. Show that the ring End,(M) is a direct 
sum of division rings. [Hint: Hom,(M, ,44,) = 0, etc.] 

2. Let M=M, © M, be the direct sum of isomorphic simple mod- 
ules M,, M,. Show that End p(M) = D;, the 2 X 2 matrix nng 
over a division ring. 


2 Noetherian and artinian modules 


Recall that an R-module MM is finitely generated if M is generated by a 
finite subset of M; that is, if there exist elements x,,...,x, © M/ such that 
M =(x,,...,X,). This is equivalent to the statement: If 4 = 2,.,M, isa 
sum of submodules M, then there exists a finite subset A’ of A such that 
M = > .c'M, We now define a concept that is dual to that of a finitely 
generated module. 


Definition. An R-module M is said to be finitely cogenerated if, for each 
family (M.)acr Of submodules of M, 


()\M,=0= ()M,=0 


acA acA’ 


for some finite subset A’ of A. 
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We show that finitely generated and finitely cogenerated modules can 
be characterized as modules that have certain chain conditions on their 


submodules. 


Definition. An R-module M is called noetherian (artinian) if for every 
ascending (descending) sequence of R-submodules of M, 


M,CM,CM,C >: (M,2M,52M,D°>:: ), 
there exists a positive integer k such that 
M, * Mya) = Misr ™ ° 
If M is noetherian (artinian), then we also say that the ascending (descena- 


ing) chain condition for submodules holds in M, or M has acc (dcc) on 
submodules, or simply that M has acc (dcc). 


Because the ring of integers Z is a principal ideal ring, any ascending 
chain of ideals of Z is of the form 
(n) C (m,) C (m2) C +, 
where n,n,,n2,... are in Z. Because (n;) S(n,;,,) implies n;,,]n,, any 
ascending chain of ideals in Z starting with 1 can have at most 7 distinct 


terms. This shows that Z as a Z-module is noetherian. But Z as a Z- 
module has an infinite properly descending chain 


(n) D (n?) D (n?)D ++, 
showing that Z is not artinian as a Z-module. 


Before we give more examples, we prove two theorems providing us 
with criteria for a module to be noetherian or artinian. 


2.1 Theorem. For an R-module M the following are equivalent: 


(i) Mis noetherian. 
(ii) Every submodule of M is finitely generated. 
(iii) Everynonempty set S of submodules of M has a maximal element 
(that is, asubmodule Mg in S such that for any submodule No in S 
with No D Mg we have No = M,). 


Proof. (i) => (ii). Let N be a submodule of M. Assume that Nis not finitely 
generated. For any positive integer k let a,,....a,€N. Then N# 
(a,,...,@,). Choose a,4, € N such that a,,, € (a,,...,@,). We then obtain 
an infinite properly ascending chain 


(2,) & (@y 442) © 2° Gy yeep) & (Ay yes Opa) Go 
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In view of the importance of noetherian and artinian rings in them- 
selves, we rewrite Theorems 2.1 and 2.2 for rings as follows: 


2.3 Theorem. Let R be a ring. Then the following are equivalent: 


(i) Ris noetherian (artinian). 
(ii) Let A be any left ideal of R. Then A (R/A) is finitely generated 
(cogenerated). 
(iti) Everynonempty set S of left ideals of R has a maximal (minimal) 
element. 


In particular, every principal left ideal ring is a noetherian ring. 


2.4 Examples 


(a) Let V be an n-dimensional vector space over a field F. Then Vis both 
noetherian and artinian. For, if Wis a proper subspace of V, then dim 
W < dim V = n. Thus any properly ascending (or descending) chain of 
subspaces cannot have more than n + | terms. 

(b) Let A be a finite-dimensional algebra with unity over a field F. Then 
A as aring is both left and right noetherian as well as artinian. To see this, 
let [A: F] =n. If we observe that each left or right ideal is a subspace of 
A over F, it follows that any properly ascending (or descending) chain 
cannot contain more than n + | terms. 

In particular, (1) if G is a finite group and F a field, then the group 
algebra F(G) is both a noetherian and an artinian ring; (11) the mm X m 
matrix ring F,, over a field Fis also a noetherian and artinian ring; (i11) the 
ring of upper (as well as lower) triangular matrices over a field F is both 
noetherian and artinian. 

(c) Let R = F[x] be a polynomial ring over a field F in x. Because F [x] 
is a principal ideal domain, it follows by Theorem 2.3 that F[x] is a 
noetherian ring. But F [x] is not artinian, for there exists a properly de- 
scending chain of ideals in R, namely, 


RDRxDRx?7D °°. 


However, every proper homomoprphic image R/A, where A is a nonzero 
ideal in R, is artinian, because we know that R is a PID. Hence, A = 
(p(x)), p(x)E F[x). Let p(x) =aygta,xt+ -** +a,x". Then 
F{x]/(p(x)) is a finite-dimensional algebra over F with a_ basis 
{1,x,....%"-'}. Hence, by Example (b), R/A = F[x]/( p(x)) is an artinian 
ring. 

(d) Let D, be the 2 X n matrix ring over a division ring D. Then D, is an 
n?-dimensional vector space over D, and each left ideal as well as each 
right ideal of D, is a subspace over D. Thus, any ascending or descending 
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chain of left (as well as right) ideals cannot contain more than n? + | 
terms. Thus, D, is both noetherian and artinian ring. 
(e) Let p be a prime number, and let 


r=29)= {7% Qos 2 <1} 


be the ring where addition is modulo positive integers, and multiplication 
is trivial; that is, ab = O for all a,b € R. Then 
(i) Each ideal in R is of the form 


12 pr-1 
A,™ Eo pete) SCOD 0 ’ 
: (3s pe? pk 
where k is some positive integer. 
(ii) R is artinian but not noetherian. 


Solution. (i) Let A * (0) be any ideal of R, and let k be the smallest positive 
integer such that for some positive integer m, m/p* € A. Consider n/p’, 
with i = k and (n, p) = 1. We assert that n/p! € A. Now n/p! € A implies 
np'—*/p' = n/p*‘eA. Also, by choice of k, 1/p*~'€A. Because (n, p) = 1, 
we can find integers a and b such that an + bp = 1. Then from n/p*, 
1/p*-! E A, we have that na/p* (reduced modulo whole numbers) and 
bp/p* (reduced modulo whole numbers) lie in A. Hence, 1/p* € A, a 
contradiction. Thus, no n/p', i = k, (n, p) = 1 can lie in A. Hence, 


[ho BH) 


This ideal is denoted by A,_,. 

(iii) Because each ideal contains a finite number of elements, each 
descending chain of ideals must be finite. Hence, 2 is artinian. 

Clearly, the chain 


A,C A,CA,C °° 


is an infinite properly ascending chain of left ideals, showing that R is not 
noetherian. Note that although each ideal A * R is finite and, hence, 
finitely generated, R itself is not finitely generated. 


2.5 Theorem. Every submodule and every homomorphic image of a 
noetherian (artinian) module is noetherian (artinian). 


Proof. Follows at once from Theorem 2.1 (Theorem 2.2). O 
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2.6 Theorem. Let M bean R-module, and let N be an R-submodule of 
M. Then M is noetherian (artinian) if and only if both N and M/N are 
noetherian (artinian). 


Proof. Let N and M/N be noetherian, and let K be any submodule of M. 
Then (K + N)/N is a submodule of M/N, and, hence, it is finitely gener- 
ated (Theorem 2.1). But then (K + N)/N = K/(NN K) implies K/((NNM 


K) is finitely generated, say 


K a = = K 
Wak (Xt +(%,,), XE WK’ 


Then 
K=(x,)+ °°: +(x,) + NOK, x,€ K. 


Further, because N is noetherian, its submodule NM K is finitely gener- 
ated, say by y,,...,. ¥, € NO K. This implies 


K= (x) + 00) + (Xm) + (WH oe + (Vn): 


Hence, K is finitely generated, so M is noetherian. The converse is 
Theorem 2.5. The proof for the artinian case is similar. O 


An equivalent statement of Theorem 2.6 in the terminology of exact 


sequences is as follows. 
Let 0 M, ~M-—M,—0 be an exact sequence of R-modules. 


Then M is noetherian (artinian) if and only if both M, and M, are noeth- 
erian (artinian). 


2.7 Theorem. A subring of a noetherian (artinian) ring need not be 
noetherian (artinian). 


Proof. For the artinian case the ring of rational numbers Q is an artinian 
ring, but its subring Z is not an artinian ring. 

For the noetherian case, the ring of 2 X 2 matrices over the rational 
numbers Q is a noetherian ring, but its subring [J 3] is not noetherian - 
that is, not left noetherian (see Example 2.15(e)]. 0 


2.8 Theorem. Let R,, 1 s isn, be a family of noetherian (artinian) 
rings each with a unity element. Then their direct sum R= @="”_,R, is 
again noetherian (artinian). 
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Proof. We know that each left ideal A of R 1s of the form A, ©... © A,, 
where A, are left ideals in R,. Soifa left ideal B = B, © ... © B, of Rissuch 
that A C B, then it is clear that A,C B,, 1 Ss issn. Hence, any properly 
ascending (descending) chain of left ideals in R must be finite because 
each R, is noethenian (artinian). O 


2.9 Theorem. If J is a nil left ideal in an artinian ring R, then J is 
nilpotent. 


Proof. Suppose J* # (0) for any positive integer k. Consider a family 
(J, J?, J3,...}. Because R is artinian, this family has a minimal element, say 
B=J™. Then B?=J2"C J” =B implies B? = B. Consider another 
family 

¥ =({A|A is a left ideal contained in B with BA # (0)). 


Then ¥ # © because BE ¥. Let A be a minimal element in ¥. Then 
BA # (0). This implies there exists an element a € A such that Ba # 0. 
But Ba C A and B(Ba) = B2a = Ba # 0. Thus, Ba € ¥. Hence, by mini- 
mality of A, Ba = A. This gives that there exists an element 5 € B such 
that ba = a. This implies b‘a = a for all positive integers i. But because bis 
a nilpotent element, this implies a = 0, a contradiction. Therefore, for 
some positive integer k, J‘ = (0). O 


2.10 Lemma. Let R be a noetherian ring. Then the sum of nilpotent 
ideals in R is a nilpotent ideal. 


Proof. Let B = 2 j-,A, be the sum of nilpotent ideals in R. Because R is 
noetherian (i.e., left noetherian), B is finitely generated as a left ideal. 
Suppose B =(x,,...,X,,). Then each x, lies in the sum of finitely many 
A,’s. Hence, B is contained in the sum of a finite number of A,’s, say 
(after reindexing if necessary) A,,...,A,. Thus, B = A, + --- + A,. Then 
by Problem 1 of Section 5 in Chapter 10, B is nilpotent. 0 


Recall that if S is any nonempty subset of a ring R, then ((S) = 
(x € R|xS = 0) is a left ideal of R called the /eft annihilator of Sin R. 


2.11 Theorem. Let R be a noetherian ring having no nonzero nilpotent 
ideals. Then R has no nonzero nil ideals. 


Proof. Let N be a nonzero nil ideal in R. Let ¥ = {/(n)|n € N, n #0) bea 
family of left annihilator ideals. Because R is noetherian, ¥ has a maximal 
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member, say /(n). Let x E R. Then nx € N, so there exists a smallest 
positive integer k such that (nx) =0. Now, clearly, I(n) ¢ I((nx)*” *). 
Because (nx)! £0, I((nx~")eF. But then by maximality of I(n), 
I(n) = I((nx}*~ '). Now 
(nxt = O0=>nxel((nx}*~ !) = I(n)=> nxn = 0. 

Now (RnR) = RnRRnR = 0. Therefore, by hypothesis, RnR = 0. If 
1 € R, then n = 0, a contradiction. So in this case we are done. Otherwise, 
consider the ideal (n) = nR + Rn + RnR + nZ generated by n. Set A = 
nR + Rn. Because nxn = 0, for all x € R, A? = 0. Thus, () = A + nZ. 
But then if n* = 0, we have (A + nZ)* = 0. Therefore, by hypothesis, 
A + nZ = 0, which gives n = 0, acontradiction. Hence, R has no nonzero 
nil ideals. O 


Remark. Indeed, one can similarly show that R has no nonzero right or 


left nil ideals. 
Next we show that a nil ideal in a noetherian ring is nilpotent. 


2.12 Theorem. Let N be a nil ideal in a noetherian ring R. Then N is 
nilpotent. 


Proof. Let Tbe the sum of nilpotent ideals in R. Then R/T has no nonzero 
nilpotent ideals, for if A/T is nilpotent, then (A/7)” = (0) implies 
A/T = (0); so A” C T. But since, by Lemma 2.10, 7 is nilpotent, there 
exists a positive integer k such that (A”)* = (0). Hence, A itself is nilpo- 
tent, so A C T. This implies A/T = (0). 

Consider the nil ideal (N + 7)/T in R/T. By Theorem 2.11, (N+ T)/ 
T = (0). This implies N C T, which is a nilpotent ideal. Hence, N is nilpo- 
tent. O 


2.13 Remark. /fRis an artinian ring with identity, then itis known that 
R ts noetherian. 


2.14 Theorem (Hilbert basis theorem). Let R be a noetherian ring. 
Then the polynomial ring R[x] is also a noetherian ring. 


Proof. Let ¥ and ¥’ be the families of left ideals of R and R[x], respec- 
tively. Let n be a nonnegative integer. Define a mapping @,: ¥’ —> F, 
where 

, (1) = {ae R| Jax" + bx"~! + ---El,a #0} U {0}. 


It is easy to verify that b,(/) € ¥. We claim that if /, J € ¥’ with / C Jand 
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,(1) = >, (J) for all n = 0, then J = J. Let 0 # f(x) € J be of degree m. 
Because @,,(/) = ¢,,(J), there exists g,,(x) € J with leading coefficient the 
same as that of f(x), and {(x) — g,,(x) is either 0 or of degree at most 
m — 1. Suppose f(x) — g,,(x) # 0. Because f(x) — g,,(x) € J, we can simi- 
larly find g,,_,(x) € J such that f(x) — g,,(x) — g,,—,(x) € Jand is either 0 
or of degree at most mm — 2. Continuing like this, we arrive, after at most m 
steps, at 


I(X) = &m(X) — Bm—1(X) — °°* — B(x) = 0, 


where g,,(X),2—,(X),... € J. But then f(x) € J, which proves that J = J, as 


claimed. 
Let A, C A, C A, C °*** bean ascending sequence of left ideals of R[x]. 


Then for each nonnegative integer 7, 

$,(A)) Cc d,(A;) Cc od, (A3) 
is an ascending sequence of left ideals of R; hence, there exists a positive 
integer k(n) such that 


b(Angny) = bnlAriny+ 1) ea (1) 


Further, because R is noetherian, the collection of left ideals (¢,(A,)), 
n & N, iG N, has a maximal element, say ¢,(A,) (Theorem 2.3). Then 


},(A,)=,(A,) (for all n = p) 
=¢,(4,) (forall n= p, j=). 
Therefore, we may choose k(n) = g for all n = p in (1). Moreover, if 
s=k(1) -:-k(p— 1)q, then @,(A,) = @,(A,4,)™ °** for all nEN. 
Hence, by the result proved in the first paragraph, A, = A,,,; ™ °°: 
Therefore, R[x] is noetherian. ( 


2.15 Examples 


(a) If R is noetherian, then each ideal contains a finite product of prime 
ideals. 


Solution. Suppose that the family ¥ of ideals in R that do not contain any 
product of prime ideals is nonempty. Then by Theorem 2.3, ¥ has a 
maximal element, say A. Because A € §, A is not a prime ideal. Hence, 
there exist ideals B and C of R such that BCC A, but BE A, CEA. 


Consider 
(B + AXC + A)C BC+ BA+ AAC+ 42CA. 
Because, B + A > A, and C+ AA, both B+ 4A and C+ 4 contain a 
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product of prime ideals, and, hence, (B + AXC + A) containsa product of 
prime ideals. This implies that A contains a product of prime ideals, a 
contradiction. Hence, ¥ = ©. 


(b) A Boolean noetherian ring is finite and is indeed a finite direct 
product of fields with two elements. 


Solution. Let R be a Boolean ring that is noetherian, and let P ¥ R be a 
prime ideal in R. Consider the homomorphic image R/P of R. Clearly, for 
all x = x + PE R/P, x? = x. Because R/P is an integral domain and the 
only idempotents of an integral domain are zero and (possibly) the unity 
element, it follows that if P * R, then R/P has only two elements. Thus, 
R/P is a field isomorphic to Z/(2), so P is a maximal ideal. Now by 
Example 2.15(a), there exist prime ideals P,,...,P,, such that 
P,P,---P,,=(0). But then P;AP,0---OP,,<¢P,;, 1<ism, gives 
(P,NP2N + OP, )"C P,P, -*: P,,=(0). Thus, (P;,NP,N«: /N 
P_,)” = (0), and because R is Boolean, P;N P,N +++ N P,, = (0). Further, 
because P, are also maximal, P; + P; = R, i #j. Then by Example 3.4(a) 
of Chapter 10, we get 


R= R/P, X R/P,X +** X R/P,, * Z(2) X ZM(2)X +++ X Z/(2). 


This completes the solution. Incidently, this also gives that the cardinality 
of R is 2* for some positive integer k. 


Remark. The above problem shows that any commutative noetherian 
ring R with no nonzero nilpotent elements in which each prime ideal 
P # R is maximal is a finite direct sum of fields. 


(c) If R is a noetherian ring, then ab = 1, a, be R, if and only if ba = 1. 
Solution. Let ab = | and ba ¥ 1. Define 
e,, = bal! — bial, i,j = 1,2,3..... 
It is easy to check that 


Cea = Ane i- 
This gives rise to an infinite set ({e,,|i = 1,2,3,...} of orthogonal idempo- 
tents in R. Set e,, =f, for all i = 1,2,3,.... Thus, we get an infinite properly 
ascending chain of left ideals 

RAGRAORLGRAORLORLG **, 


which is a contradiction. Hence, ba = 1, which completes the solution. 
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(d) Any right ideal of the ring R = (§ 8) can be generated by at most 
two elements; hence, R is right noetherian. 


Solution. Let A be a nonzero right ideal of R. Let 


X= {neZ (; “eA for some x, yeQ 
y 


0 


Then it is clear that X is an ideal in Z. Hence, X¥ = (n,) for some ny € Z, 
because Z is a principal ideal ring. 
Case 1. X # (0). We claim 


—{" ! —(%o !\p. 
A (" ‘YR or A (" 0) 


that is, A is a principal right ideal of R generated by 


a 


Then 


ng a\(k x\_ (mk noxtay 
({ $)(c “| ("s by ea 


for all k € Z, x,y EQ. Taking k= 1, y= 1/5, x = (1 — a/b)/no, we see 


that 
No | 
(re Neu 


Next, let 


nom ‘) 
0 4d 
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be an arbitrary element of A. Then 


("™ c) = (%e 1b a) 
0 ad 0 1/\0 d 


Hence, 


—{"% ! 
a=(te ')e 


In case the (2,2) entry of each element of A is 0, then, exactly as before, 
it follows that 


—([" |! 
y (" Jr 

Case 2.X = (0). Then either (i) A is a principal right ideal, or (ii) A is not 
a principal right ideal. 

(i) Suppose A is a principal right ideal generated by some nonzero 
element of the form (8 4), 8,y € Q. Then A = (8 4)R; that is, 


4-{(° %)[»<a} 


which includes the particular cases A = (9 3), when B = 0, and A =(° 9), 
when y = 0. 

(11) A is not a principal right ideal. In this situation there must exist an 
element (§ £) € A such that 8 # 0, y # 0. Otherwise, A becomes a princi- 
pal right ideal as shown next. Suppose for all (§ 4) € A, y = 0. Then 


(0 Sea(o Blo im)-(0 ode 


t 4 *)=(6 res for all yE OQ. 


So A = (29), the principal right ideal of R generated by (9 }). Similarly, if 
for all (§ 4) € A, B = 0, then 


4-5 t)#-(0 9} 


Hence, in possibility (ii) there exists (§ 4) € A, 8 * 0, y ¥ 0, such that 


42(0 sR {(0 3 )Pee} 
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So we can choose 
0 £ 
k A EA 


such that £’/B * y’/y, that is, B’y — By’ #0. Hence, given arbitrary 
P.q E Q, there exist x, y € Q such that p = Bx + B’y,q = yx + y’y. There- 


(6 )a(8 W828 AY sew 


Hence, A = (9 Q). 
Indeed, A is a right ideal generated by e,, and e,.. Summarizing, if A isa 
nontrivial right ideal in R = (§ Q), then A is one of the following: 


l NoZ Q 
(i) A= (" ) R= ( (m9 a nonzero fixed element 
01 0 Q of Z). 


(ii) A= (" i) R= ("7 2) 
(ii) A= ¢ 4 R= {(2 | ye Q| (where f,y are fixed 


oy elements of Q). 


In particular, A = (99) or (§ 8) according to whether y = 0 or 
B= 0. 


(iv) a=(° g) 


The first three are principal right ideals, the last one is not principal but 
can be generated by two elements. Thus, by Theorem 2.3, R 1s right 
noetherian. 


The following is an example of a ring which is right noetherian but not 
left noetherian. 
(e) The ring 


e-[E Sl- {2 &] 


is not left noetherian but right noetherian. 


2€7,bceQ| 
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Solution. It is easy to verify that for each positive integer k, 


A,= {| ea | me z} 


is a left ideal of R and A, ¢ A,,,. Thus, there exists an infinite properly 
ascending chain of left ideals of R, namely, 


A,C A,C A3C a 


Hence, R is not left noetherian, but R is right noetherian, as shown in 
Example (d). We may invoke Theorem 2.6 to give the following alterna- 
tive argument (perhaps a nice one too) to prove that R is nght noetherian. 

Let A = [8 8] be a right ideal of R. Then R/A = [79] as right R- 
modules. Now it can be easily verified that nonzero R-submodules of 
[2 QJare (‘7 2], when 7 is a fixed integer [see Example 1.5(e), Chapter 10]. 
Clearly, any ascending chain of R-submodules of the R-module [? 9] gives 
rise to an ascending chain of ideals in Z. It then follows that R/A = [2 9] is 
noetherian. Also, A = [§ 9] as an R-module has no proper submodules. 
Thus, A is a noetherian R-module. But then by Theorem 2.6, R is noeth- 


erian. 


Problems 


1. Let R,,...,R, be a family of noetherian rings. 

Show that their directsum R = R, © -:: © R, is again noether- 
ian. Prove the analogous result for modules. 

2. Prove that the intersection ofall prime ideals in a noetherian ring 
is nilpotent. 

3. Show that every principal left ideal ring is noetherian. 

4. Show that if R,, i= 1,2,..., is an infinite family of nonzero rings 
and if R is the direct product (or sum) of this family (R,), i = 
1,2,.... then R cannot be noetherian. 

5. Let R be acommutative noetherian ring, and let S be a multipli- 
cative subset of R. Show that the ring of fractions R, is also 
noetherian. 

6. Let R bea noetherian ring. Show that the ring of n X n matrices 
R,, over R is also noetherian. 

7. Let R be a noetherian integral domain. Then show that for all 
0*cER, Rc is large [in the sense that RecN A # (0) for all 
nonzero left ideals A in R]. [Hint: lf RcN A = (0), then 27_, Ac” 
is a direct sum.] 
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8. Let R be a left artinian integral domain with more than one 
element. Show that R is a division ring. 

9. Ifaring R contains a division ring D as a subring such that R asa 
left vector space over D is finitely generated, show that R is left 
artinian. 

10. (a) If X is any left ideal of a ring R = A,, the n X n matrix ring 
over A with unity, show that e, ,K is a submodule of the free 
A-module e,, R. If L is any A-submodule of e, ,R, show that 
2,e;,L is a left ideal of R. Verify that these correspondences 
give a 1-1 onto mapping that preserves inclusions. 

(b) Show that the matrix ring over an artinian (noetherian) ring is 
also artinian (noetherian). 

11. Show that a left artinian ring cannot possess an infinite direct 
sum ®3A, of left ideals A, of R. 


3 Wedderburn - Artin theorem 


In this section our aim is to prove that any left artinian ring R with unity 
without nonzero nilpotent ideals is a finite direct sum of matrix rings over 
division rings. Because any left artinian ring must contain minimal left 
ideals, it is appropriate to know their form. The following lemma is a 
useful fact. 


3.1 Lemma. Let A be a minimal left ideal in a ring R. Then either 
A? = (0) or A = Re, where e is an idempotent in R. 


Proof. Suppose A? # (0). Then there exists a € A such that Aa # (0). But 
Aa C A, and the minimality of A shows Aa = A. From this it follows that 
there exists e € A such that ea = a, and, clearly, e # 0 because a * 0. 
Moreover, e2a = ea or (e? — e)a = 0. If B = {c € Alca = 0}, then Bisa 
left ideal of R such that BC A and B #4, Therefore, we must have 
B= (0). But then (e? — e)a = 0 implies e? = e. Now ReC A and Re # 
(0), because 0 # e = e* E€ Re. Accordingly, Re = A, and the proof is com- 


pleted. O 
We now prove the celebrated Wedderburn - Artin theorem. 
3.2 Theorem (Wedderburn - Artin). Let R be a left (or right) artinian 


ring with unity and no nonzero nilpotent ideals. Then R is isomorphic toa 
finite direct sum of matrix rings over division rings. 
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Proof. We first establish that each nonzero left ideal in R is of the form Re 
for some idempotent e. So let A be any nonzero left ideal. By virtue of the 
dcc on left ideals in R, A contains a minimal left ideal 14. By Lemma 3.1 
either 142 = (0) or M = Re for some idempotent e. If 44? = (0), then 
(MRY¥ = (0); so, by hypothesis, MR = (0), which gives M = (0), a contra- 
diction. Hence, M = Re. This yields that each nonzero left ideal contains 
a nonzero idempotent. Consider now a family ¥ of left ideals, namely, 


¥ = {R(1 — e)N Ale is a nonzero idempotent in A}. 


Clearly, ¥ is nonempty. Because R is left artinian, ¥ has a minimal 
member, say R(1 — e)N A. We claim R(1 — e)N A = (0). Otherwise, 
there exists a nonzero idempotent e, € R(1 — e) N A. Clearly, e,e = 0. Set 
e’=e+e, — ee,. It is easy to verify that e’e’ = e’ and e,e’ #0. It is 
also obvious that R(1 — e’)NACR(1l—e)NA. But e,e’ #0 gives 
e,¢ R(il—e’)NA and e,E R(l—e)NA. Hence, R(l—e’)NAG 
R(1 — e) NA, a contradiction to the minimality of R(1 — e)NA in F. 
This establishes our claim that R(1 — e)N A = (0). Next, let a € A. Then 
ali-—e)E R(1—e)NA=(0). Thus, a=ae. Then AD ReD Ae=A 
proves that A = Re, as asserted. 

Let S be the sum of all minimal left ideals in R. Then S = Re for some 
idempotent e. If R(1 — e) ¥ (0), then there exists a minimal left ideal A 
contained in R(1 — e). But then A C Re/N R(1 — e) = (0), a contradic- 
tion. Hence, R(1 — e) = (0), which proves that R = S = 2,<,A,, where 
(A,), i € A, is the family of all minimal left ideals in R. By Theorem 4.3 of 
Chapter 14 there exists a subfamily (A,), i € A’, of the family of the 
minimal left ideals such that R = ©Z je,-A;. Let] =e, + +> +e,,04# 
e, € A, ,i,€ A’. Then R = Re, © -:: © Re,,. After reindexing if neces- 
sary, we may write R= Re, © -:: © Re,, a direct sum of minimal left 
ideals. 

In the family of minimal left ideals Re, ,...,Re,, choose a largest sub- 
family consisting of all minimal left ideals that are not isomorphic to each 
other as left R-modules. After renumbering if necessary, let this subfamily 
be Re,,...,Re,. 

Suppose the number of left ideals in the family (Re,), 1 s is n, that are 
isomorphic to Re, is n,. Then 


n,summands 7”, summands n, summands 
a — 
R= [Re,® ---] © [Re,@ --] © «++ © Re, ® +], 


where each set of brackets contains pairwise isomorphic minimal left 
ideals, and no minimal left ideal in any pair of brackets is isomorphic to a 
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minimal left ideal in another pair. By observing that Hom g(Re,, Re,) = 0, 
i j,1 s i,j = k, and recalling Schur’s lemma (Theorem 4.2, Chapter 14) 
that Hom ,(Re,,Re,) = D,, a division ring, we get, by invoking Theorem 
1.1, 


n, Xn, block 
D,; «++ D, 


D, «ee D, 


Ny X No block 
D, eee D, 


Home (R, R) = 


Dy *** D, 


Dye? D, 


(D ! Ym, 


3 


(Dy)a, 
= (Dyn, O (Daa, O *** O (Dy), - 


But since Hom ,(R,R) = R® as rings [Example 3.5(d), Chapter 14], and 
the opposite ring ofa division ring is a division ring, R is a finite direct sum 
of matrix rings over division nngs. O 


Because the matrix rings over division rings [Example 2.4(d)] are both 
right and left noetherian and artinian, and a finite direct sum of noether- 
ian and artinian rings is again noetherian and artinian (prove these), we 
get 
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3.3 Theorem. Let R be a left (or right) artinian ring with unity and no 
nonzero nilpotent ideals. Then R is also right and left artinian (noether- 


ian). 


Not every ring that is artinian on one side is artinian on the other side. 
The following example gives a ring that is left artinian but not night 
artinian. 


3.4 Example 


Let R=(2 9). If A=[$ 9], then A is an ideal of R, and as a left R- 
module it is simple. Thus, A is left artinian. Also, the quotient ring R/A = 
[2 9], which is a field. Therefore, R/A as an R/A-module (or as an R- 
module) is artinian. Because A is an artinian left R-module, we obtain that 
Rasa left R-module is artinian (Theorem 2.6); that is, 2 is a left artinian 
ring, but R is not right artinian. For there exists a strictly descending chain 


0 22 0 2Z 0 232 
[oo }>[0 o|>[0 0 ]>~ 
of right ideals of R. 


As an application of the Wedderburn - Artin theorem, we prove the 
following useful result for a certain class of group algebras. 


3.5 Theorem (Maschke). [fF is the field of complex numbers andG is 
a finite group, then 


F(G) = F,,@ ++: @ Fy, 


for some positive integers n,,...,Ng. 


Proof. We first prove that F(G) has no nonzero nilpotent ideals. Let 
G = (g, = e, 29,..., &,), and x = La,g, € F(G). Set x* = a,g;', where 
a, denotes the complex conjugate of a,. Then 


= S'la,|? + 
XX 2 il 2,F:8i 
for some f, € F. Hence, xx* = 0 implies 27. ,|a,|? = 0, so each a, = 0; 
that is, x = 0. Thus, xx* = 0 implies x = 0. Let A be a nilpotent ideal in 
F(G). Let a € A. Then aa* € A, so aa* is nilpotent, say (aa*)’ = 0. (We 
may assume r is even.) Set b = (aa*)’?. Then b? = 0 and b = b*. Thus, 
bb* = 0, which gives (aa*)"? = b = 0. Proceeding like this, we get aa* = 
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0. Hence, a = 0, which proves that A = (0). Hence, F(G) has no nonzero 
nilpotent ideals. 

Further, F(G) is a finite-dimensional algebra with unity over the field 
F. Therefore, F(G) is an artinian ring. Then by the Wedderburn - Artin 


theorem, 
F(G) = DY ® +++ © D®, 

where D, 1 = is k, are division rings. Now each D®) contains a copy K 
of F in its center. In this way each D() is a finite-dimensional algebra over 
K (How?). Let [D:K] =n, and a € D™. Then 1,a,qa?,...,a” are linearly 
dependent over K. Thus, there exist @,@,,...,.@, (not all zero) in K such 
that a) +a,a+ --- +a,a"=0. But since K is algebraically closed, 
A +a,x+ -*: +a,x" E K[x] has all its roots in K. Hence, a€ K, 
which shows that D“ = K = F and completes the proof. O 


The next theorem gives some equivalent statements for left artinian 
rings that have no nonzero nilpotent ideals. 


3.6 Theorem. Let R be a ring with unity. Then the following are 
equivalent: 


(i) Ris left artinian with no nonzero nilpotent ideals. 
(ii) Ris left artinian with no nonzero nil ideals. 
(iti) Risa finite direct sum of minimal left ideals. 
(iv) Each left ideal of R is of the form Re, e an idempotent. 
(v) Risa finite direct sum of matrix rings over division rings. 


(A ring satisfying any of these conditions is called a semisimple artinian 
ring.) 


Proof. (i) = (ii) Follows from the fact that nil ideals in an artinian ring are 
nilpotent. 

(1) = (iii) Follows from the proof given for the Wedderburn - Artin 
theorem. 

(iii) => (iv) Follows from Problems | or 4 of Section 4 in Chapter 14. 

(iv)=>(v) Let M = Re be a maximal left ideal of R with e = e?. Then 


se 4 . R ; 
R(1 —e) is a minimal left ideal because Re R(1 — e) is a simple R- 


module. Thus if S is the sum of all minimal left ideals of R then S 4 (0). 
We claim S = R. Otherwise, S is contained in a maximal left ideal Rf, 
say, where f = f7. But then R(1 — f) is a minimal left ideal which is 
not contained in S because SOA R(1 — f) CRS AR(i — f) =(0), a con- 
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tradiction. Thus S = R. Then, as shown in the latter part of Theorem 
3.2, R is a finite-direct-sum of matrix rings over division rings. 

(v) => (1) Because the matrix rings over division rings are right and left 
artinian, and the finite direct sum of left (or right) artinian rings is again 
left (or right) artinian, we get R is left and right artinian. Further, the 
matrix rings over division rings are simple rings with unity and, hence, 
cannot possess nonzero nilpotent ideals. Now if R= A, © ::: © A, 18a 
finite direct sum of simple rings each with unity, then it is easy to verify 
that any nonzero ideal J is of the form A, © -:- © A,, where (i,,...,4,} C 
(1,...,4}; hence, J cannot be nilpotent. O 


3.7 Remarks on uniqueness 


Exactly as in groups, we define the composition series of a module M as a 
sequence (M),M,,...,.!,) such that (0)=A4,C M,C -:: CM,=M, 
where each quotient module M,/M,_,, i= 1,...,k, is simple. Then the 
Jordan-Hélder theorem holds for modules possessing composition 
series. The proof carries over verbatim to modules. 


3.8 Theorem. Let D, and D/, ben X nandm X m matrix rings over 
division rings D and D’, respectively, such that D, = D‘, Thenn = mand 
D= D’. 


Proof. Let (e,;), 1 Si,jsn, and (e;,), 1 = k,/ = _m, denote the matrix 
units of the rings D, and Dj, respectively. Then 


° 


D,=@®) Dex and Di=@®Y Diey. 
1 k= 


Because D,e,, and D‘,e,, are minimal left ideals of D, and D/,, respec- 
tively, we get two composition series 


(0) C D,e,, C D,e;, © Dex C + CD, 
and 
(0) C D’,e, C D’,e’, ® Di,e4n€ «** C Di, 


of D, and D/, regarded as modules over themselves. Because D, = Dj,, it 
follows by the Jordan—Hélder theorem that n = m and D, e;,; = D},e;;- 
This implies that Homp(D,e;;,D,e;;) = Homp,(D;,e;;,D,e);). But 
Hom p (D,€;;,D,e;;) = D. (Prove this.) Hence, D= D’, O 
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The uniqueness of the decomposition of any artinian ring with no 
nonzero nilpotent ideals can similarly be proved. The interested reader is 
referred to any advanced text for studying the uniqueness of the direct 
sum decomposition of a module as given by Krull, Schmidt, Remak, 
Azumaya, and others. 


Problems 


1. Let R bea left artinian ring with unity and no nonzero nilpotent 
ideals. Then show that for each ideal J of R, R//is also left artinian 
with no nonzero nilpotent ideals. 

2. Let R be a prime left artinian ring with unity. Show that R is 
isomorphic to the 7 X matrix ring over a division ring. Hence, 
show that a prime ideal in an artinian ring is maximal. 

3. Let R be an artinian ring. Then show that the following sets are 
ideals and are equal: 

N =sum of nil ideals, 

U =sum of all nilpotent left ideals, 

V = sum of all nilpotent right ideals. 

Also show that R/N has no nonzero nil ideals. 

4. LetR = Z[x, y]//, where] = (x? — 1,xy — y, y? — 2y). Show that 
(a) Rhas no nonzero nilpotent elements. 

(b) Ris not the direct sum of minimal ideals. 

5. Let Rbea finite-dimensional algebra over an algebraically closed 
field F. Suppose R has no nonzero nil ideals. Show that R is 
isomorphic to the direct sum of matrix rings over F. 


4 Uniform modules, primary modules, and Noether—Lasker 
theorem 


Definition. A nonzero module M is called uniform if any two nonzero 
submodules of M have nonzero intersection. 


Definition. If U and V are uniform modules, we say U is subisomorphic to 
V and write U ~ V provided U and V contain nonzero isomorphic sub- 
modules. 


Note ~ 1s an equivalence relation and [U] denotes the equivalence 
class of U. 


Definition. A module M is called primary if each nonzero submodule of 
M has uniform submodule and any two uniform submodules of M are 
subisomorphic. 
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We note that Z as a Z-module is uniform and primary. Indeed, any 
uniform module must be primary. Another example of a uniform 
module is a commutative integral domain regarded as a module over 


itself. 


4.1 Theorem. Let M be a noetherian module or any module over a 
noetherian ring. Then each nonzero submodule contains a uniform module. 


Proof. Let 0 # xeM. It is enough to show xR contains a uniform sub- 
module. If M is noetherian, the submodule xR is also noetherian. But if 
R is noetherian, then xR, the homomorphic image of R, is noetherian. 

For convenience, call, as usual, a nonzero submodule N of M large if 
NoakK £0 for all nonzero submodules K of M. 

Consider now the family F of all submodules of xR which are not 
large. Clearly Oe F. Since xR is noetherian, F has a maximal member 
K, say. Because K is not large, K 7 U = 0 for some nonzero submodule 
U of xR. We claim U is uniform. Otherwise, there exist submodules 
A,B of U such that An B = (0). But then (K @ A)rq B= (0). For if xe 
(K ® A)OB, then x =k+a=b for some keK, aeA, beB. This gives 
k=b—aeKQU =(0). Hence k = 0 and b — a = 0, which further yields 
b=ae€ANB=(0). Thus b=0=a, proving x = 0; that is, (K @ A)AB 
= (0). However, this yields a contradiction to the maximality of K. 
This shows U is uniform, completing the proof. 0 


Definition. If R is a commutative noetherian ring and P is a prime ideal of 
R, then P is said to be associated with the module M if R/P embeds in M, 
or equivalently P=r(x) for some xeM, where r(x) = {aeR|xa=0} 
denotes the annihilator of x. 


Definition. A module M is called P-primary for some prime ideal P if P is 
the only prime ideal associated with M. 


4.2 Remark. If R is a commutative noetherian ring and P is a prime 
ideal of R, then an R-module 1s P-primary if and only if each nonzero 
submodule of M is subisomorphic to R/P. 


4.3 Theorem. Let U be a uniform module over a commutative noe- 
therian ring R. Then U contains a submodule isomorphic to R/P for 
precisely one prime ideal P, that is, U is subisomorphic to R/P for exactly 
one prime ideal P. 
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Proof. Consider the family F of annithilator ideals r(x), 0 # xeU. Since 
R is noetherian, there exists a maximal member, say r(x) in F. We 
claim P = r(x) is prime. For, let aber(x), agr{x). Then xab =0. Now 
xa # Oand r(xa) 2 r(x). By maximality of r(x), r(xa) = r(x). Thus xab = 0 
implies xb =0 and so ber(x). This proves P is prime. Clearly xR = 
R/r(x)= R/P. Therefore, R/P is embeddable in U. If for any other prime 
ideal Q, R/Q is embeddable in U, then [R/P] =[R/Q]. This implies 
there exist cyclic submodules xR and yR of R/P and R/Q, respectively, 
such that xR~ yR. But xR~R/P and yR~R/Q. So R/P=~R/Q, which 
yields P = Q, proving the theorem. (2 


4.4 Remark. The ideal P in the above theorem is usually called the 
prime ideal associated with uniform module U. 


4.5 Theorem. Let M be a nonzero finitely generated module over a 
commutative noetherian ring R. Then there are only a finite number 
of primes associated with M. 


Proof. Consider the family F consisting of the direct sums of cyclic 
uniform submodules of M. F is not empty, as proved in Theorem 4.1. 
We partial order F by ® DigsXiR S OD jes y;K if and only if / CJ and 
x;RC y,R for iel. By Zorn’s Lemma, F has a maximal member 
K = ®3j-,X,;R, say. Since M is noetherian, K is finitely generated 
and let K = @ 5°). ,x,R. There exist x;ajéx,R such that r(x,a;) = P;, 
the prime ideal associated with x,R. Set x = x,a,and K’ = ®)7),=,X)R. 
Let Q = r(x) be an associated prime ideal of M. We shall show Q = P, 
for some j, | <j<l. 

Since K is a maximal member of F, K as well as K’ have the property 
that each intersects nontrivially with any nonzero submodule L of 
M. Now let 04 yexROK’. Write y=5.,xj;b;=xb. We claim 
r(xjb;) = r(x;) whenever xb, #0. Clearly r(x) C r(xjb,). Let xjbjc =0. 
This implies bjcer(x;) = P; and so ceP, since bj¢ P;. Hence, cer(x;). 

Furthermore, we note Q=r(x)=r(y)=()j-17(xjb)) =()jea Ph 
omitting those arising from xjb;=0, where AC {1,...,/}. Therefore, 
Qc P, for all jeA. Also, [jen Pj) C Ojen Pj = Q. Since Q is a prime ideal, 
at least one P,; appearing in the product [],.<,P; must be contained 
in Q. Hence, Q = P; for some j. OD 


We conclude with Noether—Lasker theorem whose proof is a con- 
sequence of the above results (see Remarks 4.7). 


4.6 Theorem (Noether—Lasker). Let M be a finitely generated 
module over a commutative noetherian ring R. Then there exists a finite 
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family N,,...,N, of submodules of M such that 
(a) ()\j2,Ni=0 and ()j.,N, #0 forall 1<ig<l. 
(b) Each quotient M/N, re a P,-primary module for some prime ideal 


P,. 

(c) The P, are all distinct, | <i<l. 

(d) The primary component N, is unique if and only if P, does not 
contain P, for any j #i. 


47 Remarks. Let {U,}, 1 < i <1 be uniform submodules obtained 
as in the proof of Theorem 4.5. Choose N; to be a maximal member 
in the family {K C M|K contains no submodule subisomorphic to U;}. 
With this choice of N,,...,N,, (a), (b), and (c) follow directly. 

This form of the Noether—Lasker theorem and the results leading to 
the theorem are taken from Barbara Osofsky, Noether—Lasker Primary 
Decomposition Revisited, preprint (1994), to appear in American 
Mathematical Monthly. The reader may refer to this article for addi- 
tional details and to see how the statement given in Theorem 4.6 is 
indeed equivalent to the usual classical statement. 


4.8 Exercise (Osofsky). Let R= R[X, Y, Z] be the ring of poly- 
nomials over the reals. Let J be the ideal of R generated by {X?, XY, 
XZ}. Find the primes associated to the module M = R/I and a set 


of primary components. 


CHAPTER 20 


Smith normal form over a PID and rank 


| Preliminaries 


Throughout this chapter R is a principal ideal domain with unity. Thus, in 
particular, R may bea field. Recall that the rank ofa finitely generated free 
R-module is defined to be the number of elements in any basis of the free 
module (Chapter 14, Section 5). For an m X n matrix A with entnesin R 
we define the column rank and the row rank of A in terms of the rank ofa 
certain free R-module, and we show in Section 3 that these two ranks are 
equal - the common value being known as the rank of A. But first we need 


a key lemma. 


1.1 Lemma. Let R be a principal ideal domain, and let F be a free 
R-module with a basis consisting of n elements. Then any submodule K of 
F is also free with a basis consisting of m elements, such that ms n. 


Proof. We know F = R" as R-modules (Theorem 5.1, Chapter 14). To 
prove the theorem, we use induction on n. R® is interpreted as a (0) 
module, and this 1s free on the empty set. Therefore, we may assume that 
n> 0, and let us identify the copy of K in R” (under the isomorphism 
F = R") with K itself. Let 2: K —> R be the mapping defined by 


MX) 5.05 X_) BX. 
If x = 0, then K = Ker 7 C R"~', and the theorem follows by induction 
on n. If 2 #0, its image is a nonzero ideal Ra in R; that 1s, 7(K) = Ra, 
where a # 0. Choose k € K such that 2(k) = a. We assert 

K = Rk © Ker 2. 
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For let x € K. Write 2(x) = ba, bE R. Then x(x — bk) = ba — ba =0. 
Hence, 


x = bk + (x — bk) 


implies that 

x € Rk + Ker z. 
Thus, 

K = Rk + Ker 2. 


To prove that the sum is direct, let ck € Rk NM Ker 2, cE R. Then 0 = 
(ck) = ca; hence, c = 0 because a ¥ 0. This proves our assertion that 
K = Rk © Ker z. 

It is easy to check that the mapping r- rk of R onto Rk is an R- 
isomorphism. 

Further, Ker 2 = ((0, x,...,X, |x, € R). Thus, Ker 7 G R"~'. Hence, 
by induction, Ker 2 is free, with a basis consisting of at most n— | 
generators. Therefore, 


K = Kerz© Rk= R" OR, msn-l, 


K= R™*!, m+isn. O 


2 Row module, column module, and rank 


Let A be an m Xn matrix over R. The rows (columns) of A are the 
elements of the R-module R'*" (R”*') consisting of the 1 X 2 (m X 1) 
matrices over R. Generally, we write R'*" (and R™*') as R" (and R”). 
Using the notation R” to denote rows as well as columns never creates any 
confusion because context always makes the meaning clear. 


Definition. Let A be an m Xn matrix over R. The submodule of R" 
generated by the m rows of A is called the row module of A; and the 
submodule of R™ generated by the n columns of A is called the column 
module of A. 


R(A) and C(A), respectively, denote the row module and the column 
module of the matrix A. If the ring R is a field, R(A) and C(A) are, 
respectively, called the row space and column space of matrix A. 

Note that R(A) and C(A) are finitely generated submodules of free 
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modules R” and R”, respectively. Thus by Lemma |!.1, both R(A) and 
C(A) are free modules. 


Definition. Let A be an m X n matrix over R. The rank of the module R(A) 
[C(A)] is called the row rank (column rank) of A. 


The proof of the following theorem is exactly the same as the proof of 
the corresponding theorem for matrices over a field (Lemma 7.5, Section 
7, Chapter 14). 


2.1 Theorem. Let A be an m X n matrix over R. Let P and Q, respec- 
tively, bem X mand n X n invertible matrices over R. Then row (column) 
rank (PAQ) = row (column) rank (A). 


In order to compute the row rank or the column rank ofa given matrix, 
we reduce the matrix into “normal” form by performing certain opera- 
tions on rows and columns defined in the next section. Eventually, we 
obtain by this process the result that the row rank of a matrix A is equal to 
the column rank of the matrix (Theorem 3.3). 


3 Smith normal form 
We begin with a 


Definition. Let A be an m X n matrix over R. The following three types of 
operations on the rows (columns) of A are called elementary row (column) 
operations. 


(i) Interchanging two rows (columns). We denote by R,«* R;(C,; 
C;) the operation of interchanging the ith and jth rows columns) 
(ii) Multiplying the elements of one row (column) by a nonzero ele- 
ment of R. We denote by aR, (aC,) the operation of multiplying 

the ith row (column) by a. 

(iti) Adding to the elements of one row (column) a times the corre- 
sponding elements of a different row (column), wherea Ee R. We 
denote by R,+ aR; (C,+ aC,) the operation of adding to the 
elements of the ith row (column) a times the corresponding ele- 
ments of the jth row (column). 


Recall that ey, | Sijsn, denote n Xn matrix units, and eye, = 
d,€,, where d, is the Kronecker delta. 
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3.1 Theorem. Let A be an m Xn matrix over R. 


(i) I Ey=\—e,—e,+e, +e, then EA (AE,) is the matrix 
obtained from A by interchanging the ith and the jth rows (col- 
umns). Also, Ey'!= Ey. 

(ii) If L(a)=1+(a— le, and a is an invertible element in R, 
then L,(a)A [AL,(q)] is the matrix obtained from A by multiply- 
ing the ith row (column) by a. Also, L7'(a) = 1 + (a7! — I)e,,. 
(iii) IfM,(a) = 1+ aey, then M,(a)A [AM;,(@)] is the matrix ob- 
tained from A by multiplying the jth row (column) by a and 
adding it to the ith row (column). Also, Mj‘ (a) = 1 — aey 


(i # j). 
Proof. (i) E,;A =(1 — @;; =€yF e+ € ,)A, or 


We note that 


(a) e,,A is the matrix whose ith row is the same as that of A, and all 


other rows are zero. 
(b) eA is the matrix whose ith row is the jth row of A, and all other 


rows are zero. 
Thus, (1) shows that E,,A is the matrix obtained from A by interchanging 
its ith and jth rows. Because Ej = 1, it follows that E7'= E,,. 
(11) L(a)A — A sy. eA + ae, A. Because 
e,A = lan ° a. | ith row, 
L,(a)A is the matrix obtained from A by multiplying the ith row by a. 
Further, by actual multiplication 
(1 + (a > le, Xl + (aq! — 1)e,,) =], 
This proves L7'(a) = (1 + (a7! — I))e,,. 
(111) M,,(a@)A = A + ae, A. Because 
eyA = la, a, [th row, 
M ,(a)A is the matrix obtained from A by multiplying the jth row of A by 
qa and adding it to the ith row. Finally, 
(1 + ae,X1 — aey) = 1 — ae, + ae, = I, 


so M;' (a)=1—ae,. O 
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Definition. The matrices E,,, L,(a), and M,,(a) are known as elementary 
matrices. 


We note that an elementary matrix is the result of performing a single 
elementary row or column operation on an identity matrix. Precisely, 


E,,; = the matrix obtained from the identity matrix by inter- 
changing the ith and jth rows (equivalently, inter- 
changing the ith and jth columns). 

L;(«) = the matrix obtained from the identity matrix by multi- 
plying the ith row by a nonzero ae R (equivalently, by 
multiplying the ith column by a). 

M,,(a) = the matrix obtained from the identity matrix by adding 
to the elements of the ith row « times the corresponding 
elements of the jth row, where ce R [M,,(q) is also the 
matrix obtained from the identity matrix by adding to 
the elements of the jth column « times the corresponding 
elements of the ith column]. 


In addition to these three elementary row (column) operations, we 
apply a nonelementary operation to the rows and columns of A: multipli- 
cation by matrices of the form 


1 


Be 
(iv) v ft ’ 


where [¥ *] is invertible in R,, the 2 x 2 matrix ring over R. 

Multiplying A on the right (left) by a suitable matrix of the above form 
has the effect of replacing two of the entries on a given row (column) by 
their greatest common divisor and 0, respectively. 
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Definition. Two m X n matrices A and B over R are said to be equivalent if 
there exists an invertible matrix P € R,, and an invertible matrix QE R,, 
such that B = PAQ. 


It is clear that “being equivalent” defines an equivalence relation in the 
set of m Xn matrices with entries in R. 

Weare now ready to state and prove that every matrix is equivalent toa 
“diagonal” matrix, known as the Smith normal form. 


3.2 Theorem. JfA is an m X n matrix over a principal ideal domain 
R, then A is equivalent to a matrix that has the “diagonal” form 


a, 
Qa> 


where the a, # 0 and a,\a,| °°: |a,. 


Proof. We define the length /(a) of a ¥ 0 to be the number of prime factors 
occurring in the factorization, a = p; Pp. *** P,, p; primes (not necessarily 
distinct). We use the convention that ((u) = 0 if u is a unit. 

If A = 0, then there is nothing to prove. Otherwise, let a, be a nonzero 
element of A with minimal length /(a,,). Elementary row and column 
operations bring this element to the (1,1) position. We may then assume 
that the nonzero element of A with smallest length is at the (1,1) position. 
Let a,, / a,,. Interchanging the second and the kth columns, we may 
assume a, / 2,2. Let d = (a,, ,@,2) be the greatest common divisor of a,, 
and a,,. Then /[(d) < ((a,,). There exist elements u,v € R such that 
a,,u+a,,v=d. Because d=(a,,,@,2), there exist s,¢€ R such that 
a,, = ds and a,,= dt. Thus, d(us + vt) = d; so us + vt = 1. It can be 
verified that 
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which implies [¥ _{] is invertible. Multiplying A on the right by 


Us 


b a : 
v —S 
l 


we obtain the matrix whose first row is 


(d 0 bi ‘se by ,), 


where /(d) < /(a;,). 

Ifa,,|a,2, then by performing elementary row (column) operation (iii) 
on the first two columns (or, equivalently, by multiplying on the right by a 
suitable elementary matrix), we can reduce A to a matrix whose first row is 
again of the form (d 0 5,; --- b,,), where I(d) < I(a,;). 

Continuing this process yields an equivalent matrix whose first row has 
all entries 0, except the (1,1) entry. 

Similarly, appropriate elementary row operations (i)-(iii) and the 
nonelementary operation of multiplying on the left by the matrix of the 
form given in (iv) reduce the elements in the first column after the (1,1) 
position to 0 and either keep the elements in the first row unaltered (i.e., 
all apart from (1,1) entry are zero) or reduce the length of the (1,1) entry. 
In the second case we repeat the process by which all the elements in the 
first row except the one at the (1,1) position are reduced to 0. Because 
I(a,,) is finite, this process (of alternately reducing the first row and the 
first column) must come to an end. When it does, we have reduced A to 


the form 
a, 0 0 ::: Q 


P,AQ, =| ii 
0 
where A, is an (m1 — 1) X (nm — 1) matnx, and P, and Q, are m X mand 


n X n invertible matrices, respectively. 
Similarly, there exist invertible matrices P3 and Q% of orders 
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(m— 1) X(m-— 1) and (n— 1) X (n— 1), respectively, such that 


a, 0 - 0 
0 


P3A,Q;=| . Aa 
0 
where A, is an (mm — 2) X (nm — 2) matrix. Let 
| O 1 0O 
P, E - and Q, 5 a 


be, respectively, m X mand n X n invertible matrices. Then 


a, 0 - 0 
0 a: 0 

P,P,AQ,Q)™ A, 
0 O 


Continuing like this (or by induction on m + n), we obtain 
PAQ = diag(a, ,a3,...,4,,0,...,0). 
Finally, we show that we can reduce PAQ further such that 


a,|a2| °°: |a,. Assume a, J a,. Add the second row to the first row. The 
first row then becomes 

(a, a, O ;-- OQ). 
By performing these operations, we can reduce the length of a, . Thus, by 
further reduction we may assume a, |a, and, similarly, a,|a,, i = 3,...,r. By 
repeating this procedure for a, in place of a, , and so on, we finally reacha 
situation where @,|a,,,,/= 1,..,7r—1. O 


Definition. The nonzero diagonal elements of the matrix having the diago- 
nal form given in Theorem 3.2 are called the invariant factors of A. 


It can be shown that the invariant factors are unique up to unit multi- 
pliers, and two m X n matrices are equivalent if and only if they have the 
same invariant factors. We do not intend to discuss these questions. 
The interested reader may refer to any graduate level text in abstract 
algebra. 
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3.3 Theorem. Let A be an m X n matrix over a PID. Then row rank 
A = column rank A. 


Proof. Choose P and Q invertible matrices of suitable sizes such that PAQ 
is in Smith normal form. Then by Theorem 2.4 


row rank A = row rank PAQ = r= column rank PAQ 
=column rank A. QO 


Definition. Let A bean m X n matrix overa PID. The common value of the 
row rank of A and the column rank of A is called the rank of A. 


3.4 Examples 


(a) Obtain the Smith normal form and rank for the matrix with integral 
entries 


1 2 3 
4 5 O}° 
Solution: 


12 3]%-@fp 2 3] GrmMu-3a fy 9 
450 0 -3 -12 0 -3 -12 


S-ii} 0 OFOR:11 0 0 
0 -3 0 0 3 O07 
Clearly, rank A = 2. 


Note. Sometimes one may end up with a “diagonal” matrix 
diag(d, ,d.,...), where d, does not necessarily divide d,,, for some i. In 
such a situation One must continue performing suitable operations until 
alriving at the desired form. 


(b) Find the invariant factors of the matrix 


—x 4 2 
-3 8-x 3 
4 -8 -2-x 


over the ring Q[x]. Also find the rank. 
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Solution. The matrix can be proved to be equivalent to 


1 0 0 

0 | 0 

0 0 (x-—2Xx? — 4x + 20) 
[we performed (i) R, <> Rs, (ii) R3 + (x/4)Ry, (iii) Ro + ZR), (iv) R3 — 
2R2, (v) Cy +2C,, (vi) C3 —9Cy, (vii) 4C,, (vii) Ch + (24+ x)C,, 
(ix) —4C,;, (x) R3 +R, (xi) Cy +(2 —xX)C,, (xi) & Rs, (xii) Co + 
[(2 — x)/16JC3, (xiv) R,~+R, and C,++C, in succession]. It will be a 
good exercise for the reader to obtain this form in fewer steps. Clearly, 
rank A = 3. 


Problems 


1. Obtain the Smith normal form and rank for the following matri- 
ces over a principal ideal domain R: 


0 2-1 
(a) {| -3 8 3 |, where R = Z. 
2-4 -!1 
x= 3 2 0 
(b) 1 —X l , Where R= Q[x]. 
1 =—3 =x 2 


2. Find the invariant factors of the following matrix over Q[x]: 


5—-x | —-2 4 
0 =x 2 2 
0 0 5-x 3 
0 0 0 4 


3. Find the rank of the subgroup of Z* generated by each of the 
following lists of elements: 
(a) (3,6,9,0), (—4,— 8,— 12,0). 
(b) (2,3,1,4), (1,2,3,0), (1,1,1,4). 
(c) (= I »2,0,0), i 3, I :0), ( l, 1 9 1 9 1 ). 


CHAPTER 21 


Finitely generated modules over a PID 


Our object in this chapter is to prove the fundamental structure theorem 
for finitely generated modules over a principal ideal domain. This 
theorem provides a decomposition of such a module into a direct sum of 
cyclic modules. In particular, it gives an independent proof of the funda- 
mental theorem of finitely generated abelian groups proved earlier in 
Chapter 8. Applications of this theorem to linear algebra are given in 
Sections 4 and 5. 


1 Decomposition theorem 


1.1 Theorem. Let R be a principal ideal domain, and let M be any 
finitely generated R-module. Then 


M = R*@ R/Ra,@ -:: © R/Ra,, 


a direct sum of cyclic modules, where the a, are nonzero nonunits and 
Q,|Qj41, j= 1,....%= 1. 


Proof. Because M is a finitely generated R-module, M = R"/K [Example 
5.5(a), Chapter 14]. Further, by Lemma 1.1 of Chapter 20, K = R", ms 
n. Let ¢ be this isomorphism from R” to K. Thus, K = d({R”). Let 
(€, 5-.-9€m) De a basis of R™. Let us write 
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de.) = € R" 


Hen =|  |ER 


Then ¢(R”) = AR”, where A = (a,,) is an m X m matrix. 

Choose invertible matrices P and Q of order n X n and m X m, respec- 
tively, such that PAQ = diag(a,,a,,...,a,,0,...,0), where a, |a2|---]a,. 
Then 


uw RR RY PR R’ 
K @&R") AR" PAQR™~ PAQR™ 
a, 

a, 0 
R 
; R 
] R"/ a, * 
0 ° 
: R 

0 
m R@R @®-:--@R 
Ra, @ Ra, Be nr @® Ra, 


R R 
Ra, Ra. 


mw 
=— 
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n— k copies 


R 
OR, OR - OR (by Theorem 3.4, Chapter 14) 
k 


R 
Ra, 


[by deleting the zero terms if any 
(corresponding to those a,’s that are units)] 


R 8 
© + RG 


R R ‘ eich 
ka ® aie ORG OR (by renumbering if necessary). 


Because for any ideal / [including (0)], R// is a cyclic R-module, the proof 
is completed. O 


2 Uniqueness of the decomposition 
First, we give a 


Definition. An element x of an R-module M is called a torsion element if 
there exists a nonzero element rE R such that rx = 0. 


Definition. A nonzero element x of an R-module M is called a torsion-free 
element if rx = 0, reR, implies r =0. 


2.1 Theorem. Let R be a principal ideal domain, and let M be an 
R-module. Then 


Tor M = (x € M|<x is torsion) 
is a submodule of M. 


Proof. Left to the reader as an exercise. O 


2.2 Theorem. Let M be a finitely generated module over a principal 
ideal domain R. Then 


M=FO@® Tor M, 
where (i) F = R* for some nonnegative integer s, and (ii) Tor M= 
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R/Ra, @ «+: © R/Ra,, where a, are nonzero nonunit elements in R such 
that a,|a2| °° |a,. 


Proof. By the structure theorem for finitely generated modules over a PID, 
M= R'@ R/Ra, © -*: © R/Ra,, where a, are nonzero nonunit ele- 
ments in R such that a,|a,| °°: |a,. It then follows that 


M=FOQT, 


where F = R’, and T= R/Ra, © ::- © R/Ra,. Clearly, a,7 = 0. Thus, 
TC Tor M. 

Next, let x € Tor M. Then x= x, + x2, x, EF, x,E T. So x, = 
xX — X, € Tor M, and, hence, rx, = 0 for some 0 # rE R. If (y,,...,¥,) © 
R‘ is the image of x, under the isomorphism F = R’, then r(y,,..., ,) = 9. 
Hence, ry,=0, 1s iss. Therefore, each y,= 0, because r #0. This 
yields x, = 0 and proves that x € 7, and, thus, 7 = Tor M. This com- 
pletes the proof. O 


We are now ready to prove the uniqueness of a decomposition of a 
finitely generated module over a principal ideal domain. 


2.3 Theorem. Let R be a principal ideal domain, and let M be a 
finitely generated R-module. Suppose 


M = R?@ R/Ra, © --- © R/Ra,, (1) 
where a, are nonzero nonunit elements in R such that a,|\a,| °° |a,,. 
M = R'@ R/Rb, © -** © R/RO,, (2) 


where a; are nonzero nonunit elements in R such that a,|a,|---|a,, and 
Thens=t,u=v, and Ra, Rb,,isisu. 


Proof. By Theorem 2.2, (1) and (2), respectively, give M = F © Tor M, 
and M = F’ © Tor M, where F = R’, F’ = R'. But then 


M _FOTorM _ 
Tor M Tor M 


Thus, F = F’. This yields s = ¢ (Theorem 5.4, Chapter 14). 
We now proceed to show that if 


T= Tor M= R/Ra, © -:*+ © R/Ra, (3) 


F. 


and 
T= Tor M= R/Rb, © -*- © R/RO,, (4) 
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where the a,’s and 5,’s satisfy conditions as in (1) and (2), respectively, then 
u=vand Ra, = Rb,. For any R-module X and a prime p € R, we define 


X, = {x & X|px = 0). 

Then from (3), 
“ {R 

T,~® (#). 

: p> Ra,} , 
Clearly, 

(2) - AUP) if pla, 

a | 

Rai’> | @)  ifpra, 


and R(a,/p)/Ra, is a one-dimensional vector space over R/Rp generated 
by the element a,/p + Ra,. 

Therefore, 7, is a vector space over R/Rp, and its dimension is equal to 
the number of terms R/Ra, such that pja,. 

Suppose that pis a prime dividing a, , and hence a,, foreach i = 1,...,u. 
Let 7 have a direct sum decomposition into vysummands as in (2). Then p 
must divide at least u of the elements b,, whence u Ss v. By symmetry, 
u = v. Therefore, any two decompositions of types (1) and (2) of any given 
module must have an equal number of summands. This fact plays an 
important role in proving the rest of the theorem. 

From (3) we have a,7 = 0. But then (4) gives Ra, C Rb,. By sym- 
metry, Rb, C Ra,. Hence, Ra, = Rb,. 

Now assume (a,) = (b,), KS is u. We show (a,_,) = (b,_,). This is 
done by contradiction, obtained by multiplying (3) and (4) by a suitable 
element in R to make the first summands zero in one of the decompo- 
sitions (3) or (4), and the first k — 1 summands zero in the other decom- 
position, for some m < k — 1. 

Let p be a prime in R such that p*ja,_,, p**'/a,_, and p*|b,_,, 
p?t' b,_,. Ifa = B for each prime p, then we are fine because this means 
(a,-,) = (b,_,). Otherwise, for definiteness, let a > £. Put 


x= _— (~~) ER, where p®|a,, p°*'Ya,. 
Then (3) gives 


R R Rx + Ra, Rx + Ra, 
xT | Rar” 1) a Ra, © ---@ Ra.” (5) 


and its (k — 1)th summand is 
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Rx + Ra,_, — Rp*7'(a,/p*) + Rp*aj-, , 
pea Seek SL a SR. is am | 
Ra,.. Rp*ai.. (where (a;-1,p) ™ !) 
_ __Rd 
Rp*aj_,’ 
where d = (p*~'(a,,/p®), p%a;_ ,), the g.c.d. of p*~ '(a,,/p®) and p%a;_,. But 
, Qp-1 | 2y 
=> EB ae, | —— 
Q,-,\a, Qy- p? p? 


a, s = s 
= (o-- 7 pai... = p*'ai_,. 
Therefore, d = p®~'a,_,. Thus, the (kK — 1)th summand in (5) is 
Rp*"'a;,_ is R 
Rp*a,-, Rp’ 

because in any integral domain Ra/Rab = R/Rb, 0 #a,b€ R. 

It can similarly be shown that any summand preceding the (k — 1)th 
summand is either R/(p) or (0); and indeed if any summand is (0) then all 


the preceding ones are also zero. 
Therefore, (5) can be rewritten as 


5 terms t terms 


oe . (6) 
xT = (0) ++ ©(0)O R/M(p)@ ++ © Rp) © R/(p) 


© R/(p*)@ ++: © Ri(p*), 
where s,f = 0, and s + ¢ = k — 2. Further, from (4), 


Rx + Rb, Rx + Rb, 
xT Rp, © ©) Rb, ’ (7) 


and its (k — 1)th summand is 


FOE RS RO OSES RO crepe, py eit) 


Rb,-1 Rp*bj..., 
= Rp*bi_, ( . 
Rp*bi_, since a > B, and b,_,|6, 
by -; b, 
=> bi. = —— | —. 
k-1 p? p? 


=> p*by_,|po—' *). 
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Therefore, ((Rx + Rb,_,)/Rb,_,) = (0). Thus, RxC Rb,_,. Because 
Rb,_, C Rb,-2 C **+ C Rb,, it follows that the first k— 1 summandsin (7) 
are all zero. Therefore, the decomposition (7) of xT may be rewritten as 


(k— 1) summands 


——_—_) 
xT = (0)@ +++ ©(0)@R/(p*)@ --- © R/(p*). (8) 


Comparing (6) and (8), we obtain a contradiction, because, as remarked 
earlier, any two such decompositions of a module over a PID must have 
an equal number of nonzero summands. Hence, a = 8. This implies 
Ra,_, = Rb,_,. It then follows that Ra, = Rb, for all i= 1,...,.u. O 


3 Application to finitely generated abelian groups 

Because the ring of integers Z is a PID and any abelian group is a Z- 

module, an immediate application of Theorems 1.1 and 2.3 gives an 

alternative proof of the decomposition theorem for a finitely generated 

abelian group (Theorem 2.1, Chapter 8). 

3.1 Theorem. Let A be a finitely generated abelian group. Then 
A=Z'@ Z/a,Z@ -*:@ Z/a,Z, (1) 

where s is a nonnegative integer and a, are nonzero nonunits in Z, such that 
a,|a,| -** |a,. (2) 

Further, the decomposition (1) of A subject to the condition (2) is unique. 

(Z° is interpreted as the trivial group (0).) 

In particular, if A is generated by (x, ,...,x,) subject to 


n 
> ayx,= 0, 1 sis m, then 
= } 


(n—r) copies 
a 
Am=ZX° XZXZ/a,ZX °° XZ/a,Z, 


where Q,,...,@, are the invariant factors of the m X n matrix A = (a,,)). 


3.2 Examples 


(a) The abelian group generated by x, and x, subject to 2x, = 0, 3x, = Ois 
isomorphic to Z/(6), because the matrix (of constraints) is (3 $), which is 
equivalent to (8 9). 


Rational canonical form 409 
(b) Find the abelian group generated by {x, ,x.,x;} subject to 


5x, + 9x, + 5x, = 0, 
2x, + 4x, + 2x, =0, 
X,; + x2, — 3x; = 0. 


5 9 5 
A=|!2 4 2 
1 1 -3 
Perform C, ~~ C,, R, = 2R), C; iii C,, C; > CR; ae R,, (= 1)R;, 
(— 1)C,, C, <* C,, and C, — C, in succession to obtain that A is equiva- 
lent to 


Hence, the desired abelian group is isomorphic to Z/1Z X Z/2Z X 
Z/4Z = Z/(2) XK Z/(4). 


Problem 


Compute the invariants and write down the structures of the abelian 
groups with generators x, ,...,x, subject to the following relations: 


(a) n=2;x,+x,=0. 

(b) n= 3; 3x, — 2x, =0, x, + x, = 0, —x, + 3x, + 2x; = 0. 

(c) n=3; 2x,—x,=0, —3x, + 8x, + 3x,=0, and 2x, — 
4x, —x,=0. 


4 Rational canonical form 


The fundamental theorem of finitely generated modules over a PID has 
interesting applications in obtaining important canonical forms for 
square matrices over a field. Given an 1 X n matrix A over a field F, our 
effort will be to choose an invertible matrix P such that P~‘AP is in a 
desired canonical form. This problem is equivalent to finding a suitable 
basis of a vector space F" such that the matrix of the linear mapping 
x» Ax with respect to the new basis is in the required canonical form 
(Corollary 6.4, Chapter 14). 

Not every matrix is similar to a diagonal matrix. For example, let 
A = [) 2], a # 0, be a matrix over R. Suppose there exists an invertible 
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matrix P such that 


P-'AP = 14 a 
Clearly, 
det(P-'!AP — xI) = det(A—x/). 
Thus, 
es 0 I-x a 
QO d,-x 0 I-x! 


This implies d,d, = 1 and d,+d, = 2,sod, = 1 ™ d,. Hence, P-'AP = J; 
that is, A = /, a contradiction. 

However, simple canonical forms are highly important in linear alge- 
bra because they provide an adequate foundation for the study of deeper 
properties of matrices. As an application of the fundamental theorem for 
finitely generated modules over a PID, we show that every matrix A over a 
field is similar to a matrix 


B, 
B, 
B, 

where B, are matrices of the form 

0 0 - QO 

] 0 eee 0 

Oo 1] «-- Oo * 

0 O -- 4 * 


Let V be a vector space over a field F, and let 7; V-— V be a linear 
mapping. We can make V an F[x]-module by defining the action of any 
polynomial f(x) = ay + a,x + °°: +4a,,x™ on any vector vp € Vas 


S(x)v = agu + a,(Tv) + ++ +4, (Tv), 


where 7‘v stands for T {v). Clearly, this action of F [x] on Vis the exten- 
sion of the action of F such that xv = 7». First we note the following 
simple fact- 
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If V is a finite-dimensional vector space over F, then V is a finitely 
generated F(x]-module. 

For if (v,,...,0,,) is a basis of V over F, then (v, ,...,v,,) is a set of genera- 
tors of V as F[x]-module. 

Although it follows by Theorem 1.1 that the finitely generated F[x]- 
module V can be decomposed as a finite direct sum of cyclic modules, we 
provide a proof that gives an explicit algorithm for obtaining the decom- 
position in this particular situation. 


4.1 Theorem. Let T€ Hom,(V,V), and let f,(x),..., f,(x) be the invar- 
iant factors of A — xI, where A is a matrix of T. Then 


pipe iis Bree ope 
(fi(x)) (f,00)" 


Proof. Let (v,,...,0,) be a basis of V, and let A = (a,,) be the matrix of T 
relative to this basis. Then 


n 
Ty = Say, isjsn. 
im] 


ith 
e, = (0,...,0,1,0,...,0), i= 1,2,...,m, 


be the standard basis of the F[x]-module F[x]” = F[x] x --- x F[x] 
(n times). As we remarked earlier, v,,...,v, 18 a set of generators of the 
F[x]-module V. Let 


@: F[x}"— V 
be an F[x]-homomorphism sending e, > v,. We claim 
Ker @ = (A — x/)F[x}". 
For let 
&,(x) 
E F[x}", 


8n(x) 
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Hence, Ker @ = (A — x/)F[x]", as claimed. Therefore, 
~ — For 
(A — xI)F[x]"" 


Let Pand Q be n X n invertible matrices over F [x]. Note that for any 
n Xn invertible matrix 7 over F [x], TF [x]" = F[x]”. So 


V 


F[x]}" = F[x}" 
P(A —xI)Q F[x}"  P(A— x1) F[x]" 
P F[x]" F[x]" 


P(A — xl) Fix]" (A—x1/) F[x]”” 


by a canonical isomorphism sending 


81(x) 8,(x) 
P | + P(A — x1) F[x]"—> : + (A — xI)F [x]’. 
£402) #42) 
Therefore, 
y F[x}" Fix} 


~(A—xIFEXY P(A= x1) Fly” 

when Pand Qare any n X ninvertible matrices. Choose Pand Qsuch that 
P(A — xI)Q = diag(f;(x),...,f,0%), 

where f;(x)|2(x)] °°* U(x). Then 


Je F[x}" ___ FIX} @ +: © F[x] 
P(A—xIQ F(x! fQX)FIX]O °° OF()FI[x] 


Fld 9... @ FLL 
(f(x) (f(x) 


Note. By abuse of terminology /,(x),..., f,(x) are also called invariant fac- 
tors of A or of 7. The sequence of factors f(x), fi+1(),....f,(x) obtained 
from the invariant factors of A — x/ by discarding units, if any, are known 
as the invariant factors of F [x]-module V. 


The proof of Theorem 4.1! shows that det(A — x/) is the product of the 
invariant factors of the matrix A — xl. 


414 Finitely generated modules over a PID 


Definition. The polynomial f(x) = det(A — x1) is called the characteristic 
polynomial of A, and its roots are called characteristic roots (or eigen- 
values) of A. 


Some authors define the characteristic polynomial of A to be /(x) = 
det(x/ — A), which differs from our definition by only the factor (— 1)’, 
where n is the order of A. 


Definition. Let T& Hom,(V,V). If V as an F[x)-module relative to T is 
cyclic, then V is called a T-cyclic space. In this case T is also called acyclic 
endomorphism of V. 


4.2 Theorem. Let W be a subspace of V. Let T€ Hom,(V,V) such 
that TW C W. Then W is a T-cyclic subspace iff there exists an element 
w & W such that (w,Tw....,T*—'w) is a basis of W for some k= 1. 


Proof. Let dim V = n, and let W be a T-cyclic subspace generated by w. 
Since dim V = n, the vectors w,7w, T2w,...,7."w must be linearly depen- 
dent over F. Let 7’w be the first of these vectors linearly dependent on the 
preceding ones. Then w,7w,...,7’~'w are linearly independent, while 


T'w=awt - +a,(Tw)', a, EF. (1) 
We claim that every T‘w is linearly dependent on w,7w,...,7’—'w. For 


i= r— | this is clear, and for i = rit follows by (1). Thus, let us take i > r 
and use induction on i. Applying 7‘~’ to (1), we obtain 


T'w=aoT''w) + ++: +a,_,(T'w), 
and by induction hypothesis each term on the right side is linearly depen- 


dent on w,7w,...,7’—'w. Hence, T‘w is also. Because W = F[x]w, every 
element of W is of the form 


(bp + b,x + — + b,x™)w = (Dol + b,T+ “Tr + b,,T™)w, 


where 56,E F. It follows then from these considerations that 
({w, Tw,...,7°—'w) is a basis of W. 

Conversely, let {w, 7w,...,7 'w} be a basis of W. 

This implies that for any element cow + c,(Tw) + °°* +0,,(7"~"'w) 
of W, there exists a polynomial f(x) = cy t,x + *** +0,-,x'~' such 
that 


Cow + c,(Tw) + +** +0, (Tw) = (Cot e,xt 00° +0,_,x"—')w. 
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Hence, WC F[x]w. Because F[x]w C W,, it follows that W = (w); that is, 
W is a T-cyclic subspace. O 


Definition. Let f(x) = ag + a,x+ °°: +a,_,x*~'! + x* bea monic poly- 
nomial over F. The companion matrix of f(x) is the k X k matrix 


0 0 0 -a, 
] 0 0 —~@, 
0 ] 0 —@A, 
00 1 ~a., 


43 Theorem. Let T€ Hom,(V,V), and let W be a T-cyclic subspace 
of V with a basis (w,Tw,...,T*—'w). Then the matrix of T on Wis thek X k 
companion matrix of a monic polynomial f(x), where ({(x)) = ann w, the 
annihilator of w in the ring F [x] (also called the order ideal of w). 


Proof. Because {w,Tw,...,7 *—'w) is a basis of W, 
T*w = ayw + a,(Tw) + -** + a,_,(T*~'w), 
where a; are unique elements of F. Thus, 
(dy ta,x + °*° +a, ,x*~! — x*)\w = 0. 


If ann w = (/(x)), then 

Ay + a,x + -7+ +a,y_,x*"! — x* E (f(x), 
so k > degree f(x) =s, say. So if f(x) = by + b,x +--+ b,_,x7 1+ x°, 
then 

O= (bo + bx + --- + b- x81) + x*)w 

= bow + b, T(w) + --» + b,- 1, T*” '(w) + Tew. 

Hence, T*w is a linear combination of w, Tw,..., T°” 'w, whence s > k. This 
gives s=k. Therefore, if g(x) =a) +a,x +--- +a,_,x*"'—x*, then 
g(x) = cf(x), where ceF. By comparing the coefficients of x* on each 
side, we get c= —1. Hence, f(x) = —g(x). Because 


T*w = agw + a,(Tw) + --- + a,_,(T*” 'w), 


416 Finitely generated modules over a PID 


we obtain that the matrix of Ton W is 


0 0 coe 0 Ay 
I 0 eS 0 a; 
Ol 0 a 


0 0 ee | ay 
which is the companion matrix of f(x). This completes the proof. O 


Remark. In Theorem 4.3, f(x) = x* — a, . ,x*~'— ++» — a,x — dg is the 
minimal polynomial of T on W. 


Let V be an n-dimensional vector space over fF. Then dim 
Hom,;(V.V), = n’, so if 0 # TE Hom,(V,V), the collection (7'), i= 
0, 1,2,....27, must be linearly dependent over F. Let 7” be the first in this 
collection linearly dependent on the preceding ones. Thus, 


T’ = aol +a,T + +°+ +a,_,T™' 


for some a; € F. Hence, Vas an F[x]-module relative to 7 has the prop- 
erty that 


0=(api+a,T+ °*- +a,_,T"'—T )v 
= (dy tayxt os +a, x’! —x)v. 


Because dy) + a,x + -*> +.4,_,x'~'— x’ #0, every element of the F[x]- 
module V is a torsion element. Because V is a finitely generated F[x]- 
module, by the structure theorem of finitely generated modules over a 
principal ideal domain (Theorem 1.1), we have 


V=V,©- OV, : 


where V, [= F[x]/(f(x))] are cyclic F{[x]-modules, and 
ACMA) +++ 1-00; (F(x) is the annihilator ideal (order ideal) of V, in 
F [x]. 

Because V, isa 7-cyclic space, the linear mapping 7 on V, is represented 
by the companion matrix B, of f(x) with respect to a suitable basis 8, of V, 
as shown in Theorem 4.3. So the linear transformation T on V is repre- 
sented by the matrix A, which ts the direct sum of the matrices B,,...,B, 
with respect to the basis 8 = Uf_, @,. Thus, we have 
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4.4 Theorem. Let T€ Hom,(V,V). Then there exists a basis of V 
with respect to which the matrix of T is 


B, 


B, 


B, 


where B, is the companion matrix of a certain unique polynomial f(x), 
i= 1,....r, such that f(x) A): (x). Oo 


This form of A is called the rational canonical form of the matrix of T. 
The uniqueness of the decomposition of V subject to f,(x)| A(x) °°: f(x) 
shows that 7 has a unique representation by a matrix in a rational canoni- 


cal form. 
The polynomials f,(x),...,/;(x) are the invariant factors of A —x/, but 


they are also called the invariant factors of A (see note following Theorem 
4.1). 


4.5 Examples 
(a) Reduce the following matrix A to rational canonical form: 


-3 2 #0 
A= 1 0 I 
L 3-2 


Solution. The invariant factors of A — x/ are obtained in Problem | (b) 
in Section 3 of Chapter 20. These are 1,1,(x + 1¥(x + 3) =3+7x+ 
5x? +x. Therefore, the rational canonical form is 


00 -3 
1 0 -7 
OQ: 1 *=5 


(b) The rational canonical form of the 6 X 6 matrix A whose invariant 
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factors are (x — 3),(x — 3Xx — 1),(x — 3Xx — 1) is 


Problem 
1. Find rational canonical forms of the following matrices over Q: 


15 7 2 : 
(a)}0 4 31. (b] 1 ; 
001 3 ; 


11 
0 01 
1}. O19 g 
3 0 0 


oh & 


0 


Some authors further decompose each B, as the direct sum of matrices, 
each of which is a companion matrix of a polynomial that is a power of an 
irreducible polynomial, and call the resulting form rational canonical 
form. The powers of the irreducible polynomials corresponding to each 
block in the form thus obtained are then called the elementary divisors 
of A. 


5 Generalized Jordan form over any field 


We now give another application of the fundamental theorem of finitely 
generated modules over a PID. We shall prove that every matrix over C is 
similar to an “almost diagonal” matrix of the form 


J, 
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where 
A; O 0 
1 A, 0 
J.=10 1 
00. A, 


But first we obtain a canonical form over any field, of which the Jordan 
canonical form is a special case. 

Let V be a vector space of dimension n over F. Let T€ Hom,(V,V). 
Because Hom,( V,V) is an n?-dimensional vector space over F, the list 


Leal eee 


is linearly dependent, and, hence, there exist a,@,,...,@,: © F (not all 
zero) such that 


OQ +0,T H+ + +OpT™ = 0. 


Therefore, 7 satisfies a nonzero polynomial over F. We begin with 


5.1 Theorem. Let V be a finite-dimensional vector space over a field 
F, and let T€ Hom,(V,V). Suppose f(x) = g(x)h(x) is a factorization of 
J (x) in F[x] such that (g(x), h(x)) = 1. Then f(T) = 0 ifand only if V = 
Ker g(T) © Ker A(T). 


Proof. Suppose /(T)=0. Because (g(x),h(x))=1, there exist 
a(x),b(x) € F[x] such that a(x)g(x) + b(x)h(x) = 1. Hence, 

a(T)g(T) + b(T)A(T) = 1. (1) 
Then for each ve V, 

aT )g(T)v + b(T)A(T)v = v. 


Because 2(T)h(T) = 0, a(T)2(T)veKer h(T), and b(T)h(T)veKer g(T). 
Thus, 


V = Ker g(7) + Ker A(T). 


Further, if w €& Ker 2(7) 2 Ker A(T), then g(7)w = 0 = h(T)w. Thus, 
by (1) w=0. Hence, V = Ker g(7) © Ker h(T). Conversely, let V = 
Ker g(7T) © Ker A(T), and let vE V. Then v= v, + v5, v, € Ker 9(T7), 
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v, © Ker h(T). This gives g(T)h(T\v, + v,) = 0, because g(7) and h(T) 
commute. Therefore, /(7)v = 0 for all ue V;so f(T) =0. O 


More generally, we can similarly prove 


§.2 Theorem. Let V be a finite-dimensional vector space over a field 
F, and let T& Hom,;( VV). Suppose f(x) = p4(x) ... pfXx) is a factoriza- 
tion of f(x) into irreducible polynomials p(x) over F. Then f(T) = 0 ifand 
only if 


V = Ker p¢(T) © -°+ © Ker pf*(7). 


In the notation of Theorem 4.2, let usset W, = Ker p#(T), i = 1,...,k. It 
is clear that TW, C W,. Let 7; be the restriction of 7 on W,, so 7,€ 
Hom -(W,,W,). Let us make W, an F[x]-module relative to 7; in the usual 
way. Then W, is a finitely generated F[x]-module, and, hence, W, is the 
finite direct sum of cyclic modules (w,,), 1 sj = s. Because every element 
of W, is annihilated by pf(7), p#(7)w,; = 0. Our object is to find a basis of 
each cyclic module (w,,) so that the matrix of 7; relative to the union of 
these bases is in generalized Jordan canonical form. 

For clarity we drop all subscripts and proceed to find a basis of a 
T-cyclic subspace (v), where p*(x)v = 0, p(x) is an irreducible polynomial 
over F, and s is the smallest positive integer such that p*(x)v = 0; that 1s, 
p*(x) is the minimal polynomial of 7 [or, equivalently, ann v is the ideal 
(p*(x))]. 


5.3 Lemma. Let V = (v) be a T-cyclic space, and let the annihilator of 
v (i.e., the order ideal of v) be (p*), where p = p(x) = x' +a,_,x' + 
*** + a, is an irreducible polynomial over F. Then 


B= (v,T0,..., 7 'v) U ( pv,pTo,....pT* 'v) 
U ees U (p*—'v,p2-'To,...,p2—'!T* 'v) 


is a basis of V, and the matrix of T relative to this basis is 


P 0 0 
N P 0 
0 N 


© 
me) 
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where P is the t X t companion matrix of p and 


00 | 
0 0 eos 0 
velo: 
0 0 eee 0 


isat Xt matrix all of whose entries, except at the (t,1) position, are zero. 


Proof. Because V =(v) is a 7-cyclic space, V has a basis of the form 
{v,Tv,...,7 *~'v} for some positive integer kK (Theorem 4.2). Further, 
p*(x)v = Oimplies p°(7) = 0. Therefore, 7 satisfies a polynomial of degree 
n=st. Thus, 7™v is a linear combination of v,7v,...,.7°~'v. But then 
n= k. If possible let k < n. Then, because (v,7»,..., 7 *~'v} is a basis of V, 
we get that 7“v is a linear combination of v,7»,...,7*~ 'v, so there exists a 
polynomial g(x) of degree k <n such that g(x)v = 0, a contradiction. 
Hence, n = k = dim V. 

The set B of n = st elements in the theorem must be linearly indepen- 
dent. Otherwise there will be a nonzero polynomial /(x) over F of de- 
gree < nsuch that A(x)v = 0, a contradiction. Hence, the set B is linearly 
independent and, therefore, a basis of V. 

We now proceed to find the matrix of 7 relative to the basis B. But first 


we express each of 
T'v, pT'v,..., p?~'T'v 

as a linear combination of the basis elements. Now 
O=p—(x'+a,_,x'' + +++ +p), 


sO 
0 = pu—(T'v+a,_,T''vt+ +++ + aov), 
which gives 
T'v = —agv—a,Tv— ++ —a,_,T'"'v + pv. (1) 
Further, 
O = p? — p(x! + a,_, x"! + +> + ap) 
gives 


QO = p*v — pT'v — a,_, pT 'v— +++ — agpv. 
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Thus, 

pI'v =—aypv—a,pTv— ++ —a,_,pT*'v + pv. (2) 
Similarly, by considering 

0 = p?—! — p?-Ax!' + a,_, x! + ++ +a), 
we get 
p?-?T'v = — app’ 70 — a, p??Tv — °* — a,-,p*-?T'v + p*'v. (3) 
Finally, to obtain the expression for p*~'7T'v, we write 

0 = p'v= p*"(x' +a, , xo ' +o + ap)v, 


p?'T'v = —ayp*'v — a, p*'Tv— id —a,-,p°"'T'v. (4) 
Consider now 


Tv = To, 
T Tv = T22, 


T T'-2y = T'p, 
T T''v= T'v = —aguv —a,Tvu— -*+ —a,_,T* 'v + pv 


[from (1)), 
Tpv = pT, 
TpTv = pT’», 
TpT 70 = pT'~ 'v, 
TpT'-'v = pT'v = —agpv — a, pTv— -** — a,_, pT 'v + p*v 
[from (2)], 


Tp*-'v = p*—'To, 
Tp*~ ! Tv= p*~ 1 Tv, 


Tp?) T*-2y = p*-' Ty, 
Tp Tv = pT 'v = —ayp?'v— a, p?'To— coe 
—a,-,p?'T'v [from (4)]. 
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Thus, the matrix of T relative to B is of the form 


PO -- 0 
N PP... Q 
0 N : 
00 -- P 


where P is the ¢ X ft companion matrix 


0 0 ++ 0 ay 


1 O -+ 0 —a, 
O 1 «+ 0 a, 
00 - 1 —a,, 


of the polynomial p(x), and N is the ¢ X ¢ matrix all of whose entries are 


zero, except at the (1, ¢) position, where the entry is 1. This completes the 
proof of the lemma. O 


Let us call a matrix of the form 


P 0 0 
N P 0 
0 N 
0 0 - P 


where P is the companion matrix of a certain irreducible polynomial over 
F of degree t, and N is a matrix all of whose entries are zero except at the 
(1, 0) position, where the entry is 1, a generalized Jordan block. It is note- 
worthy that if F is the field of complex numbers, then Pisa 1 < 1! matrix, 


(A), and Nisa 1 X | matrix (1), so the generalized Jordan block becomes 
the usual Jordan block. 


5.4 Theorem. Let T& Hom;(V,V). Then there exists a basis of V 


such that the matrix of T relative to this basis is a direct sum of generalized 
Jordan blocks. 
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Proof. Follows from Lemma 5.3 and the discussion preceding the 
lemma. O 


Remark. The generalized Jordan representation (and, in particular, the 
usual canonical representation in Jordan form) of a linear mapping T is 
unique up to the ordering of the blocks. This is a consequence of the 
uniqueness of the decomposition of the F[x]-module V into the direct 
sum of cyclic modules, each of which is annihilated by a power of an 
irreducible polynomial over F. 

The polynomials p*(x), where p(x) is irreducible over F, corresponding 
to each generalized Jordan block are the elementary divisors of A. 

The matnix version of Theorem 5.4 is 


5.5 Theorem. Let A be ann X n matrix. Then there exists an inverti- 
ble matrix P such that P-' AP is a direct sum of generalized Jordan blocks 
J;, where J, are unique except for ordering. 


Note that the Jordan canonical form is determined by the set of ele- 
mentary divisors. 


5.6 Examples 


(a) For a 3 X 3 matnx with invariant factors (x — 2) and (x? — 4), the 
elementary divisors are (x — 2),(x — 2), and (x + 2), so the Jordan canon- 
ical form is 


(b) For a6 X 6 matrix with invariant factors (x + 2)?, (x + 3)?(x + 2)? 
the elementary divisors are (x + 2)?,(x + 2)?, and (x + 3)?, so the Jordan 
canonical form is 
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Problems 


1. Find invariant factors, elementary divisors, and the Jordan ca- 
nonical form of the matrices 


es 
0 4 2 Ct aa 
(a) {—-3 8 3|. () 
Re ete ee 00 53 
00 O04 


2. Find all possible Jordan canonical forms ofa matrix with charac- 
teristic polynomial p(x) over C in each of the following cases: 


(a) p(x) =(x— 1)°(x + 1). 
(b) p(x) = (x — 2)7(x + 5). 
(c) p(x) =(x+ 1Xx + 2)2(x? + 1). 


CHAPTER 22 


Tensor products 


1 Categories and functors 


Let C be a class of objects A, B,C,..., together with a family of disjoint 
sets Hom(A, B), one for each ordered pair A, B of objects. The elements 
of Hom(A, B) are called morphisms. If « € Hom(A, B), write «: A > B and 
call A the domain of « and B the codomain of a. Suppose for each 
aeHom(A, B) and BeHom(B,C) there is a unique morphism Bae 
Hom(A, C). 


Definition. We call C a category if the following axioms hold: 
(i) Associativity. Vae Hom(A, B), VBeHom(B, C), and Vye Hom(C, D), 
(Ba) = (yB)a. 
(ii) Existence of identity morphisms. VW AEC, 11,€Hom(A, A) such 
that I, f =f and gl, =g whenever I, f and gl, are defined. 


It is clear that VAeC, J, is unique. 


1.1 Examples 


(a) Let C be the family of abelian groups and let the homomorphisms 
between them be morphisms. Then C is a category, denoted by Ab or 
mod-Z. 

(b) Let C be the family of right (or left) R-modules and let the 
R-homomorphisms between them be morphisms. Then C is a category 
denoted by mod-R (or R-mod). 

(c) The family of groups also forms a category with homomorphisms 


as morphisms. 
426 
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(d) With objects as sets and morphisms as maps between them, we 
obtain a category called the category of sets. 


We now proceed to define a functor. Roughly speaking a functor is a 
map between two categories. 


Definition. T is called a covariant (contravariant) functor from a category 
C, to a category C, if for each object AEC, there is a unique object 
T(A)eEC,, and for each morphism ae Hom(A, B) there is a unique morphism 
T(a)e Hom(T(A), T(B)) [Hom(T(B), T(A))] such that the following 
axioms hold: 


(i) If 1:A—A isan identity morphism in C,, then T(I): T(A) ~ T(A) 
is an identity morphism in C,. 

(ii) If a, B are morphisms and Ba is defined then T(Ba) = T(B)T(a) 
(T (Ba) = T(a)T(B)]. 


Note that if Ba is defined then 7(8)T(a) [T(a)T(f)] is also defined 
whenever T is a covariant (contravariant) functor. 


1.2 Examples 


(a) Let C be a category with objects as groups and morphisms as 
homomorphisms. Then T is a functor from C to C if T(G)=G’, the 
derived group of G, for if a: A —- Bis a homomorphism then a| ,. = T(a) 
maps T(A) = A’ to T(B) = B’ and T(/) = I, where J is the identity map. 
Furthermore, if B is another homomorphism then T(«f) = T(a)T(B) 
(follows by definition of T(a)). Thus T is a covariant functor. 

(b) Let C be the category with objects as free groups and morphisms 
as homomorphisms. Let S be the category with objects as sets and 
morphisms as functions. Let A be a set in S and F(A) denote the free 
group generated by A. For each function a: A—B, where A,B are 
objects (= sets) in S, let F(a): F(A) > F(B) denote the homomorphism 
of the free group F(A) to F(B) obtained by extending the map a in a 
natural way. Then F its a covariant functor from S to C. 

(c) Let C be the category as in (a) with objects as groups. and 
morphisms as homomorphisms, and let Ab be the category of abelian 
groups. Let T: C-— Ab be the functor defined by T(G) = G/G’. Then 
T is a covariant functor. 

(d) Let C be the category mod-F where F is a field, thaf is, the category 
of vector spaces over F. For a vector space V, let V* = Hom(V, F), 
the dual space of V. Then T:C +C defined by T(V) = V* is a contra- 
variant functor. We may note that if a: V— V is a morphism (=linear 
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mapping) in C then T(a): V* + V* is defined by T(a)(@) = da. Thus 
for all linear mapings «a, 8B, T(«f) = T(B)T(a). 


Remark. Let R bearing and mod-R be the category of right R-modules. 
Let Hom,(—, A) [Hom ,(A, —)] denote the functor from mod-R to 
mod-Z that sends X to Hom,(X, A) [Hom (A, X)]. Then Homa(—, A) 
is a contravariant functor and Hom,(A, —) is a covariant functor. 


In the next section we define the tensor product M @,N of a right 
module M, with a left module gN, which is an abelian group in general. 
It can be shown that both 7, =(:)@rN and T,=M@®,(-) are 
covariant functors from mod-R to mod-Z. 


2 Tensor products 


In this section and the others which follow, we will briefly describe the 
concept of the tensor product of modules. Tensor products occur often 
in all branches of mathematics. The use of tensor product is indeed a 
fundamental tool for scientists. 


Let M,N, and P be abelian groups. A map f: M x N-P is called 
bi-additive if 

f(m, + mz, n) = f(m,,n) + f(m2,n) 
and 

f (m,n, + nz) = f (m,n) + f(m, nz) 


for all m,m,,m,€M and n,n,,n,€N. This implies for a fixed neN, the 
map m— {(m,n) from M to P and for a fixed meM the map n— f(m, n) 
from N to P are both homomorphisms. It follows that f(m, 0) = 0 = f (0, n) 
and f(m, —n) = —f (m,n) =f(—m,n) for all meM and neN. 

Suppose now M is a right R-module and N is a left R-module. Then 
the map f: M x N — P is called balanced if it is bi-additive and satisfies 


f (mr, n) = f (m, rn) 
for all meM, néeN, and reR. 


Throughout this chapter all rings have unity unless otherwise stated. 


Definition. A tensor product of a right R-module M and a left R-module 
N is a pair (T, h), where T is an abelian group and @ is a balanced map 
from M x N to T satisfying the following condition: Given any abelian 
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group P and a balanced map f: Mx N-P, there exists a unique 
homomorphism f: T—» P such that the diagram 


MxwN 
jo J 
T——* P 


I 
is commutative, that is, fd =f. 


Remark. It is easy to check that if (T’, ¢’) is another tensor product of M 
and N, then there exists an isomorphism «: T— T’ such that ¢' = a@. 
Thus the tensor product is unique, up to isomorphism. 


2.1 Existence and construction of tensor product 


We will now study a construction which yields an abelian group when 
we are given both a right R-module M and a left R-module N. In this 
construction, which is one of the most important in abstract algebra, 
the resulting group is indeed the tensor product of the given modules 


M and N. 
Let Z(M, N) denote the free abelian group with the set M x N asa 


basis. Let Y(M, N) denote the subgroup of Z(M, N) generated by the 
elements of the form 

(m, + m2, n) _ (m,, n) _ (m2, n), 

(m,n, + nz) —(m,n,)—(m, n2), 

(mr,n) — (m, rn), 
for all m,m,,m.,€M,n,n,,n,EN, and reR. Let T(M, N)=2Z(M, N)/Y(M,N) 
be the quotient group. We proceed to show that T(M, N) is the tensor 
product of M and N. Let P be any abelian group and f: M x N—P be 
a balanced map. We need to produce a balanced map ¢: M x N- 
T(M,N) and a unique group homomorphism f: T(M,N)— P such 
that the diagram 

MxN_ 

[es ee, 
T(M,N)—» P 
JS 


rae 
~ 
. 


commutes. We choose ¢ to be the restriction of the canonical homo- 
morphism 


n: Z(M, N) + 2%) 


= T(M,N) 
Y(M,N) 
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to M x N. Further, the mapf: M x N — P can be naturally extended to 
the free abelian group Z(M,N) with basis M x N, yielding a homo- 
morphism 


f:Z(M,N)—>P. 


But since f is a balanced map, f vanishes on the subgroup Y(M, N) and 
hence induces 


f:Z(M,N)/Y(M, N)— P 


that is, from T(M, N) to P. It is clear that fd = f. We note f is unique. 
Else, let g: T(M,N)— P be a homomorphism such that g¢ = f. Then 
gh(m, n) = f (m,n) for all meM, neN. But since {¢(m,n)|meM,neN} 
generates T(M, N), it follows g= f. This proves that T(M, N)=Z(M, N) 
/Y(M,N) ts the tensor product of M and N. 

We shall write M ®,N for T(M, N) and m@n for (m,n) + Y(M, N)= 
(m,n), me M, ne N. Since every element of Y(M, N) is mapped into zero 
under the canonical mapping @¢, we have 


2.2 Theorem. Let M and N be right and left R-modules, respectively. 
Then 


(i) (m, +m2)@n=m,On+m,@n, 
(ii) m@(n, +n,)=mMOn, +m@ny,, 
(iii) mr@n=m@Orn, 
for all m,m,,m,EM, n,n,,n,EN, and reR. 


Next we prove 


2.3 Theorem. Let M and N be right and left R-modules, respectively. 


(a) Let AeZ,meM, and neN. Then Am@n = A(m@n)=m@aAn. In 
particular, O@n=0=m@®0 and —m@n= —(m@n)=m@(—n). 

(b) Every element of M@xN is of the form ¥;,,,.®n,, meM, 
n,EN. 


Proof. (a) If nis kept fixed, the mapping given by m+ m@ nis a group 
homomorphism of M into M@,N. Thus for AeZ, 


AM®n = A(M@ n). 
Similarly 
m@ An = A(m@ n). 
(b) Since every element of Z(M, N) is of the form 
Y Am, n,), EZ, meM, nEN, 


finite 


Module structure of tensor product 431 


it follows that every element of M @,N isa finite sum ) ,(m,@ n,). But 
A(m;@ n,) = Aim, @ n,, 


and so every element of M ®,N is a finite sum of elements of the form 
m@Qn,meM,neN. (1 


Problems 
1. Prove the following: 
(1) Q@zZ, = 0. 


(ii) Z6@z Z,=0. 

(ii) Q®zZ = Q), as additive groups. 

(iv) Z,,®zZ, = Z4, as additive groups, where m, n are positive 
integers and (m,n) = d. 

(v) Z, @Q22Z = Z>. 


2. Let A be a torsion abelian group. That is, for each aeA there 
exists a positive integer n (depending on a) such that na =0. 
Show that 


A®zQ=0. 


3 Module structure of tensor product 


If M and N are right and left R-modules, respectively, over an arbitrary 
ring R, then M @,N is an abelian group. However, if R is commutative, 
then M@®,N can be given the structure of an R-module. A natural 
definition for making M@®,N a right R-module is (nm®n)r = mr ®n, 
me M,neéN, and reR. But it is quite nontrivial (though straightforward) 
to verify that this is indeed well defined. We proceed to show this, 
assuming R is a commutative ring. 
Let reR. Consider the homomorphism 


f:Z(M,N) +M@,N 


of free abelian group Z(M, N) to M@,N given by f(m, n) = mr@n. We 
note that the elements of the form 


(m, + m,n) —(m,,n) —(m2,n) (1) 
and 


(m,n, + nz) —(m,n,) —(m, n2) (2) 
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map, respectively, to 
(m,r+mzr)\@n—m,r@n—myr@n 
=m,r@Qn+m,r@n—m,r@Qn—m,r@n=0 
and 
mr @(n, +n,)— mr@n, —mr@n, =0, 
for all m,m,,m,éM and n, n,,n,EN. Furthermore, for all me M, neN, 
and seR, the elements of the form 
(ms, n) — (m, sn) (3) 
map to 
(msr)® n — mr ® sn = (msr)® n — (mrs) @n = 0, 
since R is commutative. 
Thus f vanishes on the subgroup Y(M, N) of Z(M, N) generated by 
elements of the form (1), (2), and (3). So f induces a homomorphism 
f: Z(M,N)/Y(M,N) =M®,;N>M®,N, 


where f((m,n)+ Y(M,N))=mr@n. That is, f(m@n)=mr@n. This 
proves that given m@neM@,N and reR, the element mr@n is 
uniquely defined. Thus (m @ n)r = mr @ n is well defined, and it is easy 
to verify that M@,N becomes a right R-module. Incidentally, M@,pN 
is a left R-module in a natural way. (Over a commutative ring R a 
right R-module can be naturally looked upon as a left R-module.) 


3.1 Theorem. Let M be a right R-module. Then M@zR=M as 
additive abelian groups. Further, if R is commutative, then M@,;R=M 
as R-modules. 


Proof. Consider a group homomorphism 
f. Z(M,R)—>M 
given by f(m,r)=mr. Clearly Y(M,R)c Kerf and so f induces 
homomorphism f from Z(M, R)/Y(M, R) = M@gR to M given by 
f(m@r)=mr. 


Also the map g:M > M @ xR given by g(m) = m@ | is a group homo- 
morphism whose inverse is f. Thus f is an isomorphism as desired. 
In case R is commutative, then by definition of scalar multiplication 
of elements of R with those of M@,zN 


Ff ((m@r)s) = f (ms@r) =(ms)r = m(sr) = m(rs) =(mr)s = f(m@r)s. 
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Thus f is an R-homomorphism and is indeed an R-isomorphism as 
shown in the first case. 0 


3.2 Theorem. Let M, N, and P be R-modules over a commutative ring 
R. Then 


(i) M@gN=~N@eM as R-modules. 
(ii) (M@rpN)@rpP=M@a(N @eP) as R-modules. 


Proof. First one needs to recall that if M is a right R-module over a 
commutative ring R, then M is a left R-module by defining rm = mr, 
reR, me M. Consider the homomorphism f: Z(M, N) + N @ RM given 
by f(m,n) =n@m. Since Y(M, N) c Kerf, f induces the homomorphism 
f:M@pN-7N@pM Biven by f(m@n)=n@m. One can, similarly, 
obtain a homomorphism from N@,pM to M@ RN which is the inverse 
of f, proving that M@gN~N@xgM. The proof of part (ii) is 
similar. © 


Problems 


1. Let M bea right R-module and e be an idempotent in R. Show 


M @pRe = Me (as groups). 
2. Give an example of an R-module M and a left ideal J in R 


such that M@,p1 # MI. 


4 Tensor product of homomorphisms 
Let M,M’ be right R-modules and N,N’ be left R-modules. Let 
f: M—M* and g: N-N’ be R-homomorphisms. Then the mapping 
Z(M,N)— M'®@,N'’ 
in which 
(m,n) — f(m)® g(n) 


is a group homomorphism which vanishes on the subgroup 
Y(M,N). Thus the above mapping induces a homomorphism from 
Z(M,N)/Y(M,N)=M@RN to M'@,N’, where the element 
(m,n) + Y(M, N) =m@n maps to f(m)®g(n). We denote this homo- 
morphism as f ®g. Thus 


(f @ g)(m@n) = f(m) @ gin). 
The mapping f @ g is called the tensor product of f and g. When R is 
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commutative, f ® g, as defined above, becomes an R-homomorphism 
from the R-module M@,N to the R-module M’ @,N’. 


4.1 Theorem. Let M, M’', M” be right R-modules and N, N', N” be left 
R-modules. Suppose f: M — M', f': M'— M’, g: NN’, and g’: N'> N" 
are R-homomorphisms. Then 

(f'@sVf@s)=(M' Ogg’). 


Further, iff and g are isomorphisms of M onto M' and M' onto M", respect- 
tively, then f ®g is also an isomorphism. 


Proof. The proof follows from the definition of the tensor product of 
homomorphisms. ( 


4.2 Theorem. Let (M,),.; and (Nj)j., be families of right and left 
R-modules, respectively. Then 


(EM) @x( DEN, = DL (Mi@xN)) 


jel jeJ GpjpelxJd 
which maps 
(Ex)@(5») 
onto 
>, x1 ® yj. 
ij 


Proof. The map 


s:( BEM.) x (@xN,)- Dd (M:@rN)j) 


tel jeJ Gi.frelxJd 


given by 


A((g-5r))-Fue 


is balanced and gives rise to a homomorphism 


7:( BUM: )@x( DEN) OY (MON) 


iel jeJ G.ferxJs 
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such that 


A((g»)e(9))-280 


On the other hand, the inclusions 
A: Mi> DUM; and Hy Ny > DEN, 


give rise to homomorphisms 


8ij= 4@ ji M.@aNj-+( EDM, ) Oa ®EN,). 


tel jeJ 
There exists a unique homomorphism 


z ® E(M@aN) (LOM, Ox OY N,) 
(ijpel x J iel jeJ 

which extends all the g;;s. Clearly f and g are inverses of each other 

and hence they are isomorphisms. ( 


Problems 


1. Let F bea free left R-module with a basis (e pjey Say, and let M 
be a right R-module. Then an element of M@pF can be 
uniquely written in the form >° j.x,;@e, with x,eM and x,;=0 
for almost all jeJ. 

2. If F isa free left R-module and f: M — M' is a monomorphism of 
right R-modules, then the induced map f @1: M@pF ->M'@ pF 
is also a monomorphism. 

3. Leth: A Bbea homomorphism of rings and let F be a free left 
A-module with a basis (e,),,,. Then if one regards B as a right 
A-module by defining ba = bh(a), B® , F is a free left B-module 
with a basis (1 Be ics- 

4. Let R be a commutative ring, and let F’ and F be free R- 
modules with bases (e;),., and (e,),,,, respectively. Then F’@,F 
is a free R-module with (e€;@ ej) jerx aS a basis. 

5. Let R be a commutative ring with 1, and let F be a free R- 
module. Then any two bases of F have the same cardinality. 
(Note that Theorem 5.4, Chapter 14, is a special case of this 
problem.) 

6. Let K bea field, V and W vector spaces over K, V, a subspace 
of V,and W, asubspace of W. Show that (V, @x W)A(V@, W,)= 
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V,®@,W,, where the vector spaces involved are identified to 
subspaces of V@, W by means of obvious canonical maps. 

7. Let R be a commutative ring. Show(-)@,WN is a right exact 
covariant functor from mod-R to mod-R. Show T need not be 
left exact. [A covariant functor T: mod-R — mod-R is right (or 
left) exact if for each short exact sequence 0+ A> B—~C-0, 
T(A)— T(B)— T(C)-+0 (or 0 T(A)— T(B)— T(C)) is also exact.] 

8. Let M,M’e€ mod-R and N, N’ € R-mod. Suppose {: M — M' and 
g: N--N’ be onto R-homomorphisms. Show that Ker(f @g) 
is generated by elements of the type a@n and m@®b where 
a& Kerf, b€ Kerg, mE M, and nEN. 


5 Tensor product of algebras 
Let K be a commutative ring. If A is a K-algebra, then the mapping 
Ax A-—-A given by (a,b)— ab is balanced, giving rise to a K-homo- 
morphism 

m,A®@,yA-+A 
such that m,(a@ a’) = aa’. If Bis another K-algebra, we have a sequence 
of maps 


(A ®x B) x (A @xB) *+ (A@xB)@x(A@xB) — 


(A @A)@x(B@,B) —— > A@xB, 


where the map @ is obvious and t is the unique isomorphism of K- 
modules such that 7r((a@® b)@(a’' @ b’)) = (a®a’')@(b@ db’). The com- 
position of these three maps is a mapping of (A ®, B) x (A ®, B) into 
A ®, B. This defines a multiplication in the K-module A ®, B, namely, 


(¥a.@6,)( ¥a,@b,) = 2. 410,® bib, 
i ’ 


which is easily seen to be associative. Also 1 @ 1 acts as identity. Thus 
A®,B becomes a K-algebra and is called the tensor product of the 


K-algebras A and B. 


5.1 Theorem. [f A,B, and C are K-algebras, then as K-algebras, 


(i) K@yA=A, 
(iti) A@,B~= B®, A, and 
(iii) A@,x(B@xC)=(A@,x B)@,C. 
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Proof. The isomorphism between the K-algebras follows from the 
definition of the tensor product of algebras by considering canonical 


maps. O 


5.2 Example 


Let R be a commutative ring with unity. Let A, B are ideals in R. Then 
R/A @pg R/B = R/A + B as R-algebras. 


Solution. Define a mapa:R/A x R/B—R/A + B by a(x + A,y + B)= 
xy+A+B. « is balanced and induces the group homomorphism 
a:R/A@ pR/B—-R/A + B. The map B:R/A + B— R/A @p R/B given by 
B(x + A + B) =(x + A)@(1 + B) is well-defined, for if x =a+, aeA, 
beB, then (x + A)@(1 + B)=(6+ AD@(1 + B) =(1+ A)@(b+ B= 
(1 + A)@(0 + B)=0. Clearly, &, 8 are algebra homomorphisms with 
aB = identity on R/A + B and Ba = identity on R/A @g R/B. 


Problems 


1. Let A be an algebra over a commutative ring K. Show 
A®,M,(K)= M,(A) as K-algebras. 

2. Show M,(K)@,M,(K)= M,,,(K) as K-algebras. 

3. Show Q®zQ~Q as Z-algebras. 

4. Let K beacommutative integral domain with | and let F be the 
field of fractions. Show F ®, F = F as K-algebras. 

5. Let E be an extension of a field F. Show F(x] @,-E ~ E[x]. 


Solutions to odd-numbered problems 


Chapter 1, Section 1 


lL. (iv) xEAU(BNC)@xEAorxE BNC 
= xe€ Aor(xe€ Band xE C) 
= (x € Aor xe B) and (x€ Aorx€E C) 
= xEAUBandxE AUC 
= xE(AUB)N(AUC) 


A B 
3. |AUBJ=mt+n-—k () 3 


5. A = TVs with defective 
picture tube. 
B = Those with defective 
sound system. 
C = Those with defective 
remote contro! system. 
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Chapter 1, Section 2 


1. AX B= ((a,0)\a € [0,1)) y 
U ((a,3)}a € [0,1}) 
U ((a,5)la € [0,1}). Hence 5 
the graph consists of the 
three strips as shown. 3 


0; 1 
Because (0,3) € A X B, but (0,3) ¢ B X A, we have A X B* BX A. 

. First part obvious. Second part not necessarily true. Take A,B * 0, X= @, YQ B. 

. Define x ~ yiffx — yisa multiple of 4. ~ is an equivalence relation on Z corresponding 

to the given partition. 

7. By (ii) x R y and y R y= y Rx. So the relation is symmetric. Then by (ii) and the 
symmetric property, x R yand yR z=ezRx=™x R z, proving transitivity. The con- 
verse is proved similarly. 

9. ~ is an equivalence relation, and Z/~ = (...,—3,— 1,1,3,...) U (...,.—4,—2,0,2,4....). 


A Ww 


Chapter, 1, Section 3 
1. (a) Let a,,a,€ A. Then 
(gf Xa.) = (gf Naz) = g(f(a,)) = g(f(az)) => f(a,) = f(a,) = a, = a. 
3. Let A -£& B 4 C. Suppose fg is injective. Then g must be injective. For 


g(a,) = g(a) = (fgXa,) = (/BXaz) = a, = a. 


But/ need not be injective. For take A = Z*, the set of positive integers, B = C = Z, and 
let f(x) = x2 and g(x) = — x. Then /gis injective, but / is not. Similarly, if fg is surjective, 
then fis surjective, but g need not be. 

5. By Theorem 3.5, g injective (or f surjective) =° g has a left inverse (or f has a right 
inverse) =* g(or f) is invertible, since gf = I,. 

7. Let A= (aj,....Am), B= (0;,...,.5,), and let {: A —> B. Suppose m <n. The possible 
choices for f(a,) are n. Having fixed f(a,), the possible choices for f(a,) aren — 1, and so 
on. Thus, there are n(n — 1) °° (n — m + 1) possible injective mappings from A to B. 
When m > n, the elements of B are exhausted before we have assigned an image to each 
element of A (since fis injective, no two distinct elements can have the same image). 
Therefore, there is no injective mapping from A to B when m > n. 

9. n". Among n", n! are injective and, hence, bijective (Theorem 3.3). 


Chapter 1, Section 4 


1. (a) (2¢3)¢4 = 12¢4 = $76. 2°(3 ° 4) = 2° 36 = 144. So this operation is not asso- 
ciative. Also 2 °3 = 12,3¢2 = 18; hence, 2° 3 % 3 « 2. Thus, eis not commutative 
either. 

(b) x°(yz) = x°(min(y,z)) = min(x,y,z). Similarly, (x y)« z= min(x,y,2). 
Hence, this operation is associative; it is also commutative. 

(c) This is commutative but not associative. 

(d) This is both commutative and associative. 
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3. (xo yplozm(xeaeyloz=xearyeqez= xo(yo z). Further, 
XOPBXeAepreKeyog=s=yexeg=yegqexmypoy, 


5. First, the total number of binary operations is the number of mappings from a set with 
n? elements to a set with n elements, which is n”. Among these the commutative ones 
are 


OP AN2+ 9 ae alate iy? 


Chapter 2, Section 1 
1. (a) Let P(n) = 12 + 37+ «>» +(2n— 1). We want to show 


’ 
P(n) = ae I) 


Clearly, P(n) holds for n = 1. Assume P(n) holds for n = k. 
Pik +1) = 12+ 32+ + (2k — 1)? + (2k +1)? 


k(4k? — 1) 
3 


= P(k) + (2k +1) = + (2k + 1)? 


= (k + 14k? + 8k + 3) a (k + 1X 4(k + 1)? — 1) 
3 3 : 
so P(n) holds for n = k + t. This proves P(n) holds for all n. 
(b) Similar. 
3. (a) Trivial for n = 1,2. Assume for the case n = k. Consider 


(\4)'~((4) ra) = (Aa) 


(by the case n = 2) 


k 
= (U Ai) U Abe (by induction hypothesis) 
(=) 


k+l 


a Ai, 
im] 


showing that the result is true for n = k + 1 and, hence, for all 7. 
(b) Similar. 
5. (a) Let dd’ be 1.c.m. of a and 6. Then ajd, bjd and ald’, bjd’.Thus, (ii) gives dla’. 
Similarly, d’|d. Thus, d= da’ because d and d’ are positive integers. 
(b) Suppose d= pf --- pf and g, = max(e,, f/;). Then ald and djd. Further, if ajx and 
b\x, then x = ay = bz for some y,z € Z. Clearly, pf, where g, = max(e,, f;), must 
divide x, 1 s isk. Hence, d|x. Thus, d = [a,5]. 
(c) First observe that (a,b) = pf: --- pi‘, where h, = min(e,, f,). Using this fact gives 
ab = [a,b}(a,6). 
7. Let a/b be a rational solution. Then 


a am} 
a4 62 +o +S + a, 0, 


This implies 


a 
ajar" + + + b"7a,_,a + ba, =~. 
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But then a*/b € Z; that is, bla. Hence, a/b is an integer. Further, if a is an integral 
solution, then 
a(a*-' + aa*-2 + «>>» +4,_,)=—a, ala,. 


9.am1’ (mod p), 2a™2’ (mod p),...(p—1)a™(p—1) (mod p), where (1’, 
2’,...(p — 1)’) = (1, 2,...,(p — 1)) and (a,p) = 1. Thus, 


(p— 1)ta*-"=(p— 1)! (mod p), 
so a?~' = | (mod p). Therefore, a? = a (mod p). 


Chapter 3, Section 5 


1. Suppose A isan m X n upper triangular matrix, so that a,, = 0 whenever i > j. For every 
o€S,,0% e, there is some i such that o(i) < i. Then @,,,,, = 0. Hence, 


det A= ¥ Edin eee Bngny ™ 21:87 “7° A. 


3. Using the definition gives 


!1 a a 
1 5b 62) = bc? + ab? + atc — a2b — b’c — ac? 
1 c c?}=(a—bXb—cXc— a). 


5. det A = det A? = (det A)? (Theorem 5.4)=det A =0 or 1. 


Chapter 3, Section 6 


(on subtracting the first row 


1 a a? 
from every other row) 


0 b-a b'-a} 
0 c-a ci-a 
b-a b-a (on expanding according to the first 
c-—a ci-@ column) 
1 6? +a*+ab 
=(OaKe=a) 1 c?+a*+ac 
= (b— aXc — aXc — bXa + b+ 0). 
(b+c? a? a? 
b? (c + a)? a 
c? c? (a+b)? 
(o+c)? a®?—-(b+c)? a?—(b +c)? 
Bb 8 (c+a)?— 6 0 
Cc 0 (a+ bP? —c? 
(6+0c)? a-—b-c a-b-c 
b? cta-b 0 
c? 0 at+b-c 
2bc —2¢ —2b 
Bb cta-—b 0 
c? 0 a+b-c 


laa 
1 6b B 
lece 


3.(a) 


(on subtracting the first column 
from the other two columns) 


(on taking out the common 
factor (a + b + c) in the 
second and the third columns) 


(on subtracting the second and 
third rows from the first row) 


=(a+b+c) 


=(a+b+c) 
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2 | 2c —2bc —2bc (on multiplying the second 
= ator b? b(c+a-—b) 0 and third columns by 5 
c? 0 c(la+b-—c) and c, respectively) 
1 —! — | (on taking out the common 
=2(atb+c)?|b? b(c+a-—b) 0 factor 2bc in the first row) 
c? 0 cla+b-c) 
I 0 0 (on adding the first column to the 
=2(a+b+c)? |b? b(c+a) b? other two columns) 
c? c? c(a + b) 


= 2(a + b + c)?(b(c + a)c(a + b) — bc?) 


= 2abc(a + 6 +c). 
a-b-c 2a 2a 
3.(b) 26 b-—c-—a 2b 
2¢ 2c c-~a-—b 
at+b+c atb+c atbt+c (on adding the second and third rows 
a 2b b-c-—a 26 to the first) 
2c 2c c—a-—b 
1 1 1 
=(a+b+c){2b b-c-—a 26 
2c 2c c-~a-b 
| 0 0 
=(a+b+c)|2b —b-—c-—a 0 
2c 0 —-c-a-—b 
=(a+b+c)°. 
Chapter 4, Section 1 


1. Because addition in R is associative, 
(f(x) + B(x) + A(x) = f(x) + (g(x) + A(x) 
for all x € R and all mappings, f,¢,4 € A. The mapping given by x — 0 for all x € R is 
clearly the identity in A. Finally, each mapping /: R — R hasan inverse f~ € A given by 
S-(x) = —f(x) for all x E R. 

3. For all a,b in G, 
ab = aeb = a(ab)*b = aababb = ebae = ba. 


5. Writing 


e=(0 th e-(0 1) e=(o th Co -1) 


we easily verify that a2 = b?=c?=e, ab=c= ba, be=a=cb, and ca=b= ac. 
Hence, G = (e,a,b,c) is closed under multiplication, e is the identity, and each element 
is its own inverse. Because matrix multiplication is associative, G is a group. 

7. Leta & G. Theset(a'|i € N) C Gis finite. Hence, a! = a/ for some positive integer i < j. 
Choose a positive integer p such that i < p(j — i), and set e = a®%J-, Then ea! = a’. 
Hence, 


e? =e a! e QHs- tt = a! e Qhi--t = ¢, 
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9. There exist integers x and y such that xm + yn = 1. Hence, for all a,b in G, 


ah" = (a™b")xmtyn 
= a™(b"qm)xmtyn— ipa 
= a™(b"a™)*™(b"a™)—*"(b"a™)"b" 
oc (b"a™)*™a™(b*a™)—'b"(b"a™)”" 
= b"Q™. 


Thus, every mth power commutes with every nth power. Hence, 


ab = qxmtynprmt yn = (a*)™(a”)"(b*)™(b”)" — ba. 


Chapter 4, Section 2 


l. 


3. 


5. 


G = (ek = 0,1,...,2 — 1). Let f: Z/(n) > G be defined by f(k) = e%/", fis well 
defined and is an isomorphism. 

f(x) = x? =>(xy)? = f(xy) = f(x) ((y) = x?y?. Thus yx = xy. 

Since the order of the image of an element under any homomorphism divides the order 
of the element, elements of order 2 in S, must map to 0 in Z,. 


Chapter 4, Section 3 


Ll; 


3. 
>. 


Ha=H=>a=ea€ Ha a€éE H. Conversely, if a € H, then forevery hin H, hae H 
and h = ha~'a € Ha. Hence, Ha = H. 

The mapping f: H-+x~'Hx given by f(a) = x 'ax is bijective 

Let H<G, aé&G. For every h in H, (ah)! = h-'a-! € Ha™ and ha! = (ah™')“'. 
Hence, Ha! consists of inverses of all elements of aH. 


. (a) |HK{ 1s a divisor of |H| as well as of | K|. Hence, |HAK| = 1. 


(b) |HAK| divides |H] = p. Thus HA K = {e}, or HAK = H which implies H < K. 


. Suppose A = aH = bK for some subgroups H,K of G. Then a € bK. Hence, a = bk for 


some k € K. Therefore, a~'b = k—' € K. Hence, H = a~'(aH) = a~'bK = K. Again, 
A=aH = A= (aHa™')a. Hence, A is a right coset of the subgroup aHa™'. 


. Suppose o(ab) = m. Then (ba)™ = a~'a(ba)™ = a~'(ab)"a = e. Hence, o(ba)|m. Simi- 


larly, o(ab)\o(ba). 


. Because o(x~'ax) = o(a), it follows that x~'ax = a for every x € G. Hence, a € Z(G). 


Further, if n > 2, there exists a positive integer m (> 1) relatively prime to n. Then 
o(a™) = o(a), which contradicts the hypothesis that a is the only element of order 7. 


. Because x? # e => x * x~!, it follows that there are an even number of elements x 


satisfying x? # e. Hence, there are an even number of elements x such that x? = e. 
Because e is one of these, there are an odd number of elements of order 2. 


. Let ofa”) = k. Then a” = e, Hence, rjmk. But (r,m) = 1. Hence, rik. On the other 


hand, (a”)' = (a’)™ = e. Hence, kjr. 


. Let o(a) = m and 0o(g(a)) = k. Then 


(g(a))™ = g(a) ++ g(a) = g(a”) = g(e) =e’. 


Hence, k|m. Further, if g is injective, then g(a“) == (g(a))* = e’ implies a* = e. There- 
fore, mik. 
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21. Clearly, e € C(S). If x,y © C(S), then 
xy !s =m xy" !syy"! = xy" lysy—! = xsy—! = sxyW! 
for all s € S. Hence, xy~' € C(S). C(G) is the center of G. 


Chapter 4, Section 4 


1. The roots of x* = 1! are Pe la td where k =0,1,...,2 — 1. These roots form a 
n n 


2 2 
group generated by w =cos = + isin— (see Problem 1, Section 2). 
n n 


. G=(Z,+). If a € Z generates (Z,+), then | = ma for some mE Z, soa= +1. 

5. (Only if) Let aE G, a * e. Then G = [a]. Now apply Theorem 4.4. 

7. Since S, is not abelian, there is no nonzero homomorphism from §, onto a cyclic group 
of order 6; nor onto a cyclic group of order 3 (see Problem 5, Section 2). There is only 
one nonzero homomorphism from S, = {e, a, a,b, ab,a*b} onto Z;, given by a0 and 


b— 1. 
9. Z, x Z, is isomorphic to the Klein four-group, which is not cyclic. 


Cod 


Chapter 4, Section 5 


1. The only proper subgroups of S, are the cyclic subgroups generated by o = (1 2 3), 
t, = (2 3), t2 = (1 3), and t, = (1 2). 

3. Using the notation in the text, we get o't = t,,, and t0!= 1,,,_,. Hence, o',t com- 
mute if and only if i= 0 or 2/ =n. If nis even, Z ™ (e,0°%?). 


Chapter 5, Section 1 
1. For all ae Z(G), xEG, 
xax-' = axx-'=@E Z(G). 
3. ForallnE N, mEM,xEG, 
x(nm)x—" = (xnx—'!\xmx-') € NM. 


5. The cyclic subgroup [b] generated by b=(; is of order 2, and a~ 'ba¢[b}, 


where a=( 5 ). 
23 1 


. For all heH, xeG, 
Xhx~! = (xh)2h-"(x')? € H. 
Hence, H 4G. Moreover, for every coset xH € G/H, (xH\xH) = x*H = H. Hence, 


G/H is abelian. 
9. N is a subgroup of index 2 in S, and, hence, normal. S,/N = {N, tN}, where tN = {(2 3), 


(3 1), (1 2)). 
11. det(B) = 1 => det(A~ ' BA) = det(A ~ ')det(B) det(A) = det B= I. 


13. Clearly, O(a/b + Z) divides b. 


~ 
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15. G/Z(G) is cyclic, hence G is abelian (see Example 1 .5(h)), which contradicts |G/Z(G)| = 37. 

17. In the notation of Theorem 5.2, Chapter 4, choose 
A=[o), B=[t]. 

19. The set of all normal subgroups of G is a lattice under partial ordering by inclusion, 
with H v K = subgroup generated by H and K = HK and H a K = HoiK. Since H,K 
are normal subgroups, HK is also normal. Furthermore, if H&L, then H(KQL) = 
HK OL. The lattice of all subgroups of G will not, in general, be modular. Take G = D, = 
{e, a, a?, a°, b, ab, a?b, a°b}, the dihedral group, L = [a], H =[a?], and K =[b). Then 
H(KQNL) # HKOL. 


Chapter 5, Section 2 


1. (o(a))" = o(a) -*- ofa) = ofa"). Because o is injective, o(a") = e’ = o(e) holds if and 
only if a" =e. 

3. If Gis generated by a, consider d: G —» Z,, given by a!» r(i) = remainder of i modulo 
4, 

5. Consider ¢: D, — Z, given by o/- 0, ot 1 for all i. 


Chapter 5, Section 3 


1. Aut(Z) = Z). 

3. Aut(K) = S;. 

5. Verify that the indicated homomorphisms are onto. 

7, By invoking Theorem 2.1, Chapter 8, the given abelian group A = Z,, x Z,, x K where 
n,|n, and K is some suitable abelian group. Let Hom(A, A) denote the set of all homomor- 
phisms of any abelian group A. It can be directly vertified that if A= Z,, x Z,, x K. 
then as additive groups 

Hom(Z,,,Z,,) Hom(Z,,,Z,,) Hom(K, Z,,) 
Hom(A, A)~} Hom(Z,,,Z,,) Hom(Z,,,Z,,) Hom(K,Z,,) | = M(say), 
Hom(Z,,,K) Hom(Z,,,K) Hom(K,K) 
a; 
acting on A, whose elements are viewed as | a, [,a,EZ,,,a,€Z,,, @,€K. Indeed 
ay 
Aut(A) = Aut(M). Choosing ceAut(Z,,), de Hom(Z,,, Z,,) suitably one can produce two 
automorphisms a, Be Aut(A) given by 


o 6 0 1 6 0 
.-[0 1 | -| 1 | 

0 0 1 00 1 
such that af x Ba. 


Chapter 5, Section 4 


1. xEZ(G)<o/C(x)] = I. 
3. (e), ((2 3), (3 1), (1 2)), ((1 2 3), (1 3 2)). 


446 
5. 


~~ 


oO 


: N <G is clear. The orbit of (|) consists of ( \( 
0 0 a™! 
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By Theorem4.4 there is a homomorphism ¢:G-~S, with Ker@=N= 
N.egxHx7' C H. Now NG and G/N = Im OC S,. Hence, G/N is of finite order. 


. Let x€ Gand o{x)§NI. [x]N/N < G/N, so |[x]N/N| divides G/N. But then (|[xJN/N|, 


|N|) = 1. From Example 3.1 1(e) in Chapter 4,(|[x]/INO [x], (NI) = 1. But since 
(x) = j[x]] divides |N}, o(x) must divide | NM [x]]. This yields [x] = NN [x], so x € N. 


. If bis a conjugate of a, then 


b= xax—' = (xax—'a7')a E G'a. 


Hence, C(a) C Ga. 


. G acts on N under conjugation. Since N ¢ Z(G), there exists an element néeN such that 


1 <{C(n)| < 3, since N = |)C(n). Thus |C(n)| = 2 and hence [G:G,} = 2, where G, is the 
stabilizer (= normalhizer) of n. 


X,, =4, and X,,, = 4. Thus by the Burnside Theorem the number of orbits = !(24) = 4. 
The four orbits are precisely 


2 2 2 2 2 2 
i 302 Nj, 1——3| ax, ee 


Ash awl? a} 


. Let X be the set of all possible necklaces. As in Example (c), |X| = 2° and Z, acts on 


X. We compute X, forall geZ,. Writing Z, = (1, 2, 3, 4, 5, 6 =0} and beads a,,a2,...,d, 
the action of 2 will transform the beads (in this order) 4, 424304450¢ (0 €304050,0, 4). 
In order that these are the same necklaces, the beads a,,a,,a, must be of one color 
and a;,@,,a, must also be of one color. Thus the number of choices = 2:2 = 4. So |X| = 4. 
Similarly the action of 3 yields that beads a,,a, must be of the same color, beads a,,a, 
must be of the same color, and beads a,,a, must also be of the same color; and this yields 
X,=8. Similarly X, = 2, X, =4, X, =2, and X, = 2°. Thus by the Burnside Theorem, 
the number of different necklaces = 1(84) = 14. 


. Because the top and the bottom of the neckties are interchangeable, the neckties 


A 
and 

i Ee 

in| ae 


are the same. Let X be the set of all ordered sets (a,,a),...,a,) of n colored strips and 
. 3. ts 

G be the subgroup of S, generated by the element a = ") of order 2. Then 

n 


n—J- 
X is a G-set and the number of distinct neckties is the number of orbits. Let n = 2m; 
then x =(1 n)(2 n—1)-- pi m + 1). The ordered set (a,,...,a,) is fixed by a if and only if 
Ay = Ay, Oy =A, 14-0 Om . Thus X, =k*® =k"?, Thus the number of distinct 
neckties = (k* + 2), A atne argument yields the desired result for the case when n 


is odd. 
b ! 


0 
a,,@, = 1,2, or at. 


), a = 1,2, or 4. Thus the orbit 


*((o)(o))*{((o)-C)} 
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Chapter 6, Section 1 


1. 


The quaternion group Q has three composition series of the form (e) C (e,—e} C 
(e,— e,x,— x} C Q, where x is 


i. tee he 
0 -vTy t-1 of “ Lr of 


. (e} C (e,a)} C (e,a,b,ab) = G. 
. (0) C [15] C [S] C G, (0) C [15] C [3] C G, 


(0) C [10] Cc [5] C G, (0) C [10] C [2] CG, 
(0} C [6] C [3] C G, (0) C [6] C [2] CG, 
where G = Z/(30), and [m] denotes the cyclic subgroup generated by m. 


. LetNg=N2N,2>N,2> -- > {e} bea composition series of N and Go/N = G/N >G,/N 


>---N/N = {eN} be a composition series of G/N. Then G>G,>:-- >N2N,>°: > 
{e} is a desired series. 


. If any abelian group A has a finite composition series Ag = A>A,>A,>-°--->{e}= 


A,+,. then since A;/A,;,, 1S a simple abelian group, |A;/A;.,| =p, for some prime p;. 
This gives | A| = pop, -::p,. where p, are primes. Thus, Z has no finite composition series. 

Clearly Z > 2Z327Z352°Z25--- and Z33Z>53°Z > 3°Z BD --- are infinite composi- 
tion series and the Jordan-Holder theorem fails. 


Chapter 6, Section 2 


3: 


lab 10b 100 
o=6.-4[01¢| atceR} 6, 010 || beR}, and G, = | 010 | = {e}. 
001 001 001 


Then G, AG, and G/G, and G,/G, are abelian. Thus G is solvable. 
Let G=A x B. Clearly G/{{e} x B) ~ A. Since A and B are solvable, it follows that G is 
solvable. 


Chapter 6, Section 3 


l. 


3. 


5. 


S, has a trivial center, if n = 3, and hence is not nilpotent. For if Z(G) = (e), then 
Z(G) = (e) for all i. 

S, has a normal subgroup N = (e,(1 2 3),(1 3 2)), where N and S;/N are both abelian 
and, hence, nilpotent. But S; is not nilpotent. 

Assume the result is false. Let G be a group of minimal order such that G is nilpotent, 
|G| = p*s, (p,s)=1, and pf|Z(G)|. Because |Z(G)| #1, |G/Z(G)|<|G|. Also, G/Z(G) 
is a nilpotent group, and pdivides|G/Z(G )| because p ¥|Z(G)|. Thus, by the minimality 
assumption p divides Z(G/Z(G)) = Z,(G)/Z(G). Let x = xZ(G) € Z,(G)/Z(G) be of 
order p. Clearly, pjo(x). By replacing x by a suitable power of itself, we may assume that 
x itself is of order p. Now y = x7'g7'xg € Z(G) for 2:1 g € G. Because x ¢ Z(G), there 
exists g € G such that x~'g-'xg * e. We note that p does not divide o(x~'g~'xg), 
because p does not divide {Z(G)j. Further, x~'g~'xge Z(G) implies 
o(x + x~'g~' xg) = o(x}o(x~'g~'xg) [Example 3.11(d), Chapter 4]. Thus, o(g7'xg) = 
o{x)o(x—'g-' xg). But o(x) = o(g~'xg). Hence, o(x~'g7 'xg) = 1; that is, x7 'g7'xg = e, 
a contradiction. 
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Chapter 7, Section 1 


1. If an index { occurs in @ only, then af(i) = a(i) = Bafi). If i occurs in neither @ nor 8, 
then af(i) = i = Bari). 

3. For any cycle y, o{y) = length of y. Let o = y, °*: y, be a cycle decomposition, with 
Ay,) = m,, i= 1,...,&. Let o(6) = m, and I.c.m.(m,,...,m,) = 1. Then 


emote yt os yh me yt = e me m,|m, f= 1,..,K. 
Hence, /|m. On the other hand, o/ = y/ --» yf = e, Hence, mil. 


Chapter 7, Section 3 


1. Any transposition (i/) € S, can be decomposed as (ij) = (in\_jnXin). Because every 
permutation is a product of transpositions, it follows that S, is generated by the set (17), 
(2n),....(n — In). Further, it is easily verified that for any i, | sisn— 1, 


(in) =(12---n— 1)7 79 -n — Lal 2---n— It 


Hence, S, is generated by (n — 1 n) and (1 2---1 — 1). 

3. Let H be a group of order 4n + 2. Then H < S,,,4, (Theorem 5.1, Chapter 4) by 
identifying each a € H with the corresponding permutation /, given by left multiplica- 
tion by a. We claim that H contains an odd permutation; that is, H ¢ A,,42. Now by 
Cauchy's theorem 3a € H such that o({a) = 2. Regarding H as an [a}-set, we obtain 
H = U e4(x,ax) as a disjoint union of orbits. Thus, a = (x, ax,)°*+ (x, @x,), a product 
of k = 2n + 1 disjoint transpositions. Therefore, H contains an odd permutation, as 
claimed. To complete the solution, we note that HAgns2 @ Aans2) $0 Sang: ™ HAagns2: 
This implies 
Sane? = HA gns2 _ H 
Agn+2 Aan+2 Aans2 OH 
proving that [H: A,,4;9 H) = 2. 
7x 6 §x4x3 

2 
420. Now |C(x)| = 


= 420. Thus the number of elements in the conjugate class of x =(ab)(cde) = 


|S] 
|N(x)| 
7. The following are six possible positions by rigid motions with vertex 1 fixed: 


yields | N4(x)] = 12. 
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Pieaea eerie : 

a= » a =e, 
148 5237 6 
34567 8 

b=(1 6 ), tae 
143258 7 6 


ab = ba?. 
This proves /? ~ S,. Clearly, (G| == 8 6 = 48. 


. By Cayley’s Theorem G < S, for some n> 0. Define a mapping {:5,—S,,, given by 


f(c) = 0, if o is an even permutation, and {(o) = ot, otherwise, where t=(n+1 n+ 2) 
is a transposition f{ is a monomorphism. Thus, S, is embeddable in A,, 5. 


Chapter 8, Section 1 


. Clearly, H and K are subgroups of Z/(10) such that HO K = (0) and 


H+ K = (0,2,4,6,8,5,7,9,11,13) = (0,1,2,3,4,5,6,....9) = Z/(10). 


. Suppose, if possible, Z/(8) = #7 © K, where H and K are nontrivial subgroups of Z/(8). 


Then one of the subgroups must be of order 2, and the other subgroup must be of order 
4. Suppose |H| = 2 and |K|= 4. The only subgroup of order 2 in Z/(8) is (0, 4). Thus, 
H = (0,4). Now 2 € Z/(8) implies 2 = a + b, aGH, bEK. Ifa=0, then 2€ K, so 
4=2+2€ K, a contradiction because HN K=0. If a= 4, then b= —2€ K, and 
again we get a contradiction. Hence, Z/(8) cannot be decomposed as a direct sum of two 
nontrivial subgroups. 


Chapter 8, Section 5 


1. 


3. 


The mapping o: G — K defined by o(a,b) = 6, (a,b) € H X K = Gisahomomorphism 
of G onto K. Ker o = H X (e). Thus, G/H X (e) = K 

Suppose, if possible, Q= H@K. Let OF a/bEH, O#c/dE K. Then ac= 
(bcXa/b) € H and ac = (adXc/d) € K. Hence, HN K # (0), a contradiction. Thus, Q 
cannot be decomposed as a direct sum of two nontrivial subgroups. 


. By Theorem 3.2 and by hypothesis A = A, © +: © A,,, where|A,| = p,. Hence, each A, 


is a cyclic group of order p,, | Ss is m. But then A is cyclic of order p, -°: p,,. 


. By Theorem 3.1,A = A, @ -*: © A, isadirect sum of cyclic groups A,, where|A,|= ™, 


and m,|m,| +> (my. If k= 1, A is cyclic, so k > 1. Let pjm,. Then pjm,. Thus, by 
Cauchy's theorem A, and 4, contain subgroups B, and B,, respectively, each of order p. 
Then B = B, © B, is a subgroup of type (p. p) of A. 


9. Use the first Sylow theorem and the result that a subgroup of index 2 is normal. 


. 200 = 23 - 52. The number of Sylow 5-subgroups is 1 + 5k and | + 5k{200. This im- 


plies k = 0. Hence, there is a unique Sylow 5-subgroup that must be normal. 


. There are | + 7k Sylow 7-subgroups. Also (1 + 7k)|42 implies k = 0. Hence, there is a 


unique Sylow 7-subgroup. 


. Let|G| = pf'-:: pf. Let H,,...,H, be Sylow p-subgroups of G. Because HH, 4 G, each H, 


is the unique Sylow p-subgroup of G. Clearly, H,QM}.,, jo, H, = (e). Thus, by 
Theorem 1.1, G is the direct product of H,,...,H,. 


. If |G/Z(G)| = 77 = 11-7, then by Example 4.7(f), G/Z(G) is cyclic. Hence, G is abelian, 


a contradiction. 
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19. We know |Z(G)|> 1. If |G/Z(G)|= 1, p, then G is abelian, a contradiction. Thus, 
|G/Z(G)| = p?, so|Z(G)| = p. Further, 


|G/Z(G)| = p? => G/Z(G) is abelian = G’ C Z(G) = G’ = Z(G). 


21. Let HS>NONP, and let xeN(H). Now P and x~!Px are Sylow p-subgroups in H. 
Thus, there exists y © H such that P = y~'x~'Pxy. This implies xy € MP) C H, so 
xE H. 

23. First we show that if H is a proper subgroup of nilpotent group G, then M(H) * H. Let n 
be maximal such that Z, ¢ H. Choose x € Z,,,, x & H. Let hE H. Then x hx = 
h(h-'x—'hx) € HZ, = H, so x € MH). This proves our assertion. Now, let P be a 
Sylow p-subgroup. Suppose P is not normal in G; that is, M(P) * G. Then P is a proper 
subgroup of G, so by the assertion in the first sentence, M(P) # P. Using the same fact 
again for the proper subgroup MP), we have M(M(P)) * N(P), acontradiction (Problem 
21). This proves that every Sylow p-subgroup is normal. The “only if” part is completed 
by Problem 15. The converse follows from the fact that a finite p-group is nilpotent, and 
a finite direct product of nilpotent groups is nilpotent. 

25. P< MP), soHNP<HN MP). Also, (HM NM P))P < G. This implies (HN N(P))P = 
P, because P is Sylow p-subgroup. Then 


. (HOMP)P HON ?P) _ HON?) 
Y P HNMP)NP HNP 
=> HANP)=HOP. 


27. Let L be a Sylow p-subgroup of K. Then Lc M, where M is a Sylow p-subgroup of G. 
Thus, Lc M=x~'Px for some xeG. But x =hk, heH, keK. So Lo k7'h7' Phk=> 
kLk>' ch! Ph=>kLk~'ch-'Phok. But then kLk~! =(h7Ph)NK. 


Chapter 9, Section 4 
1. (a) 


(feXx) = S(x)g (x) = g (x) f(x) = (ef Xx) = fg = gf. 
(f1)x =f(x)l = f(x) > fl =f 


(b) Let 


0, Osxsi, 
foy= {oy fees) 


x-4, Osxs}, 

so= {3 ¢sxsl. 
Then, f.geS, f #0, g #0, but fg =0. 

(c) Let f= f? € Sand let a € [0,1]. Then either f(a) = 0 or f(a) = 1. Suppose f * 0 and 
fel. Let A=(xe€ [0,1] f(x) = 0), and let B= (x € [0,1]| f() = 1). Then AU 
B= (0,1),A * O, B* ©, A * [0,1], and B * [0,1]. From real analysis, A and Bare 
closed; also, they are bounded. Because AM B = ©, we may assume that 1.u.b. 
A*1, Let p=1.u.b.A. Then, clearly, f is not continuous at p, a contradiction. 
Thus, either f= 0 or f= 1. 

(d) Let f,, ET. Then (f/, —f,X@) =/\(a) — f(a) = 0. Thus, f, —f, € T. Similarly, 
Kf, € T. Therefore, T is a subring. The last statement is clear. 


Solutions to odd-numbered problems 451 


3. (a) (i) Leta+ bv=1, c+ dV—-1€ A. Then 


(a + bV=1) — (c+ dV=1) = (a— 0) +(b-— d) V-I1 EA. 
Also, 
(a + bV=1Xc + dV=1) = (ac — bd) + (ad + bc) V-1 E A. 


Hence, A is a subring. 
(ii) Let a + bV—3,c + dV—3 € B. Then 


(a + bV—3) —(c + dV—3) = (a—c) + (6 — d) V—3. 


Case |. If a,b,c,d € Z, then (a — c),(6 — d) € Z. 

Case ll. If a,b € Zand c,d are halves of odd integers, then both a — cand b — d 
are halves of odd in . Proceeding like this, we get in all possible cases 
(a + bV—3) — (c + dvV—3) € B. Next, 

(a + bV—3X(c + dV=3) = (ac — 3bd) + (ad + bc) V-3. 


If a,b,c,d € Z, then (a + 6 V—3Xc + dv—3) € B. Suppose a,b € Zand c,dare 
halves of odd integers. Then ac — 3bd, ad + bc € Z, ifa and b are both even or 
both odd integers. But ac — 3bd and ad + bc are halves of odd integers if a is 
odd, 5 is even, or vice versa. Proceeding in this manner, we get in each case that 
(a + bV—3Xc + dv—3) € B. Hence, B is a subring of C. 
(b) Because eae — ebe = e(a — be € eRe and (eaeXebe) = e(aeb)e € eRe, eRe is a 
tee It is clear that e isa Se, 


vertible elements. 

(b) Let 2,6 €(U(R),:). Then (ab)-' = b-'a~', so ab € U(R). Because (R,-) is a semi- 
group, (U(R), :) becomes a group. _ 

(c) xE Z/(n) invertible = 3y € Z/(n) such that xy= 1 => 1 —xy=nk for some 
integer k=el =xy+nk=(x,n)=1. Conversely, (x,n) = 1 = 3a,bE Z 
such that | =xa+nb=—1=xa+nb=xa=<x is invertible. Let U(R)= 
(x € Z/(n)\(x,n) = 1). Then (U(R), *) is a multiplicative group of order ¢(n). Thus 
XM") om; that is, x) = | (mod n). 


. Leta™ =0, 6" = 0. Let k = max(m,n). Then 


0403 (*)oroao 


Next, if R= F,, the 2 X 2 matrix ring over a field, and if a = (8 }), b = (9 8), then 
a? =( = >? but a + bis not nilpotent; in fact, a + b is invertible. 


. Let R be an integral domain with characteristic n * 0. Suppose n = pm, p,m <n. Then 


there exists a € R such that pa # 0. Let x € R. Then ( paXmx) = (pmax) = n(ax) = 0 
implies mx = 0 for all x € R, a contradiction, because mm < n. Hence, n must be prime. 


. Let R = (0,a,,...,4,) be an integral domain. Let a, € R. Then @,@,,...,4,a, are all non- 


zero and are distinct elements of R. Thus, given a, € R, there exists a,€ R such that 
a,a,= a,, that is, the equation ax = 5 has a solution for all nonzero a € R and for all 
nonzero 6 € R. Also, if b = 0, then, clearly, x = 0 is a solution. Hence, by Example 
4.4(d), R is a division ring. 


. aua= a= quab=ab=au= 1 ua=—! 


=uab=bmy=bmba= |}, 


. LetO* aE Rand bE R. Then there exists c € R such that ac = e. This implies acb = 


eb = b. But then ax = bhasa solution. Hence, R isa division ring [see Example 4.4(d)}. 
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17. 


21. 
23. 


25. 


Let (ab)™ = 0. Then (ba)"*! = b(ab)"a = 0. 
Let ab + c — abc = 0, Then it is straightforward to verify that 


ba + (— ba + bca) — ba(— ba + dca) = 0. 


. Letat+c-—ac=0. Then 


ba + be — bac = 0 = b + bc — bc =0=>5=0. 


Let e + b— eb = 0. Then e? + eb — 2b = 0 = ce? = 0; that is, e = 0. 

Let e € R be such that e is not r.q.r. Clearly, e # 0. If e? is r.q.r., then e is also r.q.r. 
(Problem 20). Thus, e? is not r.q.r. and, hence, e = e?. Leta € R. Suppose ea * a. Then 
a—eate*e and so ts r.q.r. Thus, 35€ R such that a—- ea t+e+b—-—(a-—eat 
e)b = 0. Multiplying on the left by e and using the fact that e = e?, we obtain e = 0, a 
contradiction. Therefore, a = ea for all a € R. Further, if ae * a, then ae~ ate*e; 
so there exists cE R such that (ae — a + e) + c— (ae— a + ebc = 0. Because ¢ is a left 
identity as already shown, we obtain ae — a + e + c — ac + ac — c = 0; in other words, 
ae—a+e=0. This yields, by multiplying by e on the right, e = 0, a contradiction. 
Hence, e is the identity of R. 

Now, let0* aE R. Then e—a*¥e, so there exists bE R such that e—a+b— 

(e — a)b = 0; that is, e — a + ab = 0. This implies a(e — 6) = e. Hence, each nonzero 
element has a right inverse. Therefore, R is a division ring [you may use Example 
4.4(d)). 
Let x=(a,,0,....0), »=(0,a3,0,....0)E R,© °° OR,, where 0%a,ER,, 0% 
a, € R,. Then xy = 0, but x # 0 and y * 0. If a™ = 0, then a? = 0, 1 s isn. Con- 
versely, if a= 0, 1 s issn, then by choosing m= max(m,,...,m,), it follows that 
a" = (0. 


Chapter 10, Section 1 


3. 


>: 


7, 


LetO*# aE R. Then @R is a nonzero ideal, so aR = R, which proves that ab = | for 
some bE R. 

Let 0O* aE R. Then aR is a nonzero mght ideal, so aR = R; that is, every nonzero 
element has a right inverse, proving that 2 ts a division nng. 

As remarked just after the definition of a PID, any ideal in F[x] is of the form (/(x)), 
J(x) € F[x]. 

First we show that an element f(x) = 27.,4a,x‘is invertible in F[[{x]] iffa, * 0. If f(x) is 
invertible with inverse g(x) = If55,x', then, clearly, dgby = 1, $0 do * 0. Conversely, 
if ag * 0, we show that there exists g(x) = 27.,5,x' that is the inverse of f(x). Tenta- 
tively, we assume {(x)g(x) = | and obtain the following equations: 


Agd) = |, 
apd, + abo = 0, 
ayb, + a,b, + aby = 0, 


aob, + a,b,_, +... + a,by = 0, 


Because a, ¥ 0, this system of equations determines 5) ,5, ,b,, and so on. Having deter- 
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mined 5,,,,...,5,-;, we can find 5, from the last equation. Thus, by induction on a, 
each 5, can be determined. This proves /(x) is invertible with g(x) as its inverse. 

Let A be a nonzero ideal, and let S = (m = Oja,, x” + a,4,x"*!' + +++ EA, and 
a,,*# 0). Obviously S*#@. Let m be the smallest positive integer in S. Then 
X™(a,, + Quy Xt °**)EA=OX™EA. Thus, A = (x”). 


Chapter 10, Section 2 


3. 


Ideals in Z/(n) are of the form (a) + (n)/(n), a © Z (Theorem 2.6). Because (a) + (1) = 
(d), where (a,n) = d, it follows that the ideals in Z/(n) are (d)/(n), where djn. 

Let f: Z/(m) — Z/(n) be a homomorphism, where SQ) = 1. Then 0 = (0) = f(ml) = 
ml = m & Z/(n) = nim. For the converse, define /(a + (m)) = a + (n). fis well defined 
because nim. 


. Let x,yE r(S), rE R. Then S(x — y) = Sx — Sy = 0, S(xr) = (Sx)r = 0. Thus, r(S) is 


a right ideal. Similarly, /(S) is a left ideal. 


. Leto: F —> Rbeahomomorphism. Because Ker cis an ideal, Ker o = (0) or F. Hence, oa 


is 1-1 or O, 


. Let x,y © A, and let B be any nonzero right ideal. Then r(x) N r(y) ON B # O gives that 


there exists 0 ¥# be B such that xb = 0 = yb; that is, (x — y)b = 0. Thus, r(x — y) VB #0, so 
x — yE A. Next, let a € A. IfaB = 0, then xaB = 0. This implies r(xa) N B * 0. There- 
fore, xa € A. But if aB #0, then r(x)M aB * 0 implies that there exists 0% bE B 
such that xab = 0; that is, r(xa)Q B*0. Hence, in any case for all xE A, aE R, 
xa € A, We now show ax € A. Clearly, r(x) C r(ax). Because r(x) N B * 0, we obtain 
r(ax)N B * 0. Hence, ax € A, proving that A ts an ideal in R. 


. Suppose A ¢ Band A ¢ C. Then there exists a, € A, a, € Band a, € A, a, ¢ C. Now 


consider a, + a, which isin A and, hence, in B U C. Thus, a, + a, € Bora, + a, € C. 
Suppose a, + a, € B. Note that a, € A, a, € C implies a, € B. Thus, a, + a,€ B 
implies a, € B, a contradiction. Thus, a, + a, ¢ B. Similarly, a, + a, ¢ C. But this is 
impossible. Therefore, our assumption that 4 ¢ Band A ¢ C is false. 


. Let (x,0),(y,0) € RP and (a,b) € R. Then (x,0) — (y,0) = (x — y,0) € RP, (x,0Xa,5) = 


(xa,0) € R®, and (a,bXx,0) = (ax,0) € R?. Hence, Rfis an ideal in R. Similarly, Rfis an 
ideal. Clearly, the mapping x —»(x,0) is an isomorphism (as rings) of R,; onto RP. 


. The mapping a: (4,,) — (a,,) + A, Of (R/A), into R,/A, is clearly well defined. For 


= (a,,) + A, = (b,j) + Ay. 


Also, ¢ is an onto ring homomorphism. Hence, a is an isomorphism. 


- (feXxy) = S(a(xy)) = Sle (vig) 


= f(g(x)) f(a») = (J2X-OUBXy). 


Hence, fg is a homomorphism (or isomorphism). 


. (a) O*% x + (p*) © Z/(p?) is invertible 


ee 1 — xy  (p*) for some 0 # y + (p?) € Z/(p) 
oe | — xyep’Z 
oo (x,p) = lop yx. 
Clearly, (p)/(p), the set of noninvertible elements, forms an ideal. 
(b) Note that a) + a,x + a,x? + --> € F[[x]] is invertible iff ag * 0. The set (x) of 
noninvertible elements is clearly an ideal (see Problem 7, Section 1, Chapter 10). 
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Chapter 10, Section 3 


1. 


3. 


ip 


Let aE A. Thena=a+0E€A+A4; that is ACA+A. Lettat+bdEA+A4,0,bE A. 
Then a + bE A; that is, A + ACA. Hence, A+ AA. 
The mapping 
a 
x+ (A, (x + Ay,..X + A,) 
tI 


is an isomorphism of R/12.,A, onto R/A, X -:: X R/A, [see Example 3.4(aXii)). So if 
(x, + Ay Xe + A,) E R/A, Meee K R/A,, then there exists x + 1"p.,4, E€ RIN, A, 
such that (x + A,,...,.X + A,) = (x, + A;,....X,_ + A,) that is, x — x,E A, 1 Sisn, 


« (a) f,((a,,43) + (6,,62)) = f\(a, + 5, ,a, + 5) 


=a, + b, =f,(a,,@2) + f,(d, by). 
Si((@s 4X5, .b2)) = f,(a,b, a2) = a,b, of S\(2, ,@3)f,(d, 62). 


Thus, f, is a homomorphism. Clearly, /, is onto. 

(b) Let A be a right ideal of the ring R = R, © R,. Leta = (a,,a,) € A. Then/,(a) = a, 
and f(a) = a,. Soa E f(A) © f(A), that is, A C f,(A) © (A). Next, let x, € f,(A). 
Then 3x,€R such that (x,,x,)€A. This implies (x,,x,X1,0)€ A; that is, 
(x, ,0) € A. Similarly, if y, € f(A), then (0, y,) € A. So (x,,y,) = (x,,0) + (0,y,) © 
A. This proves f,(A) © f(A) C A. Hence, A = f(A) © fA). 

(c) Follows from (b). 

Define a mapping 0: R X S—* R/A X S/B by o(x,y) = (x + A, y+ B). Then o is an 
onto ring homomorphism and 


Ker o = ({x, y)]xE A, yE B) AX B. 
Hence, by the fundamental theorem of homomorphisms 


Chapter 10, Section 4 


By Corollary 1.3 in Chapter 10, F,, has no nontrivial ideals. Hence, any ideal in F,, is (0) 
or F,,. So if AB C (0), where A and B are ideals in F,,, then we must have A = (0) or 
B = (0). (Both A and B cannot be equal to F,,, since AB C (0).) 


x8 4m (x2 + 2x + 2x2 — 2x + 2). Thus, (x2 + 2x + 2x? — 2x + 2) E (x4 + 4). 


But x2 + 2x +2 ¢ (x4 +4) and x?—2x+2 ¢(x‘+ 4). Hence, by Theorem 4.6, 
(x4 + 4) is not a prime ideal in Q{x]. 


. (a) => (b) Let aROC P. Let 


A™=aR + Ra + RaR + Za, B= bR+ Rb+ RbR+ Zb 
be the two-sided ideals generated by a and 5, respectively. Then 
AB = (aR + Ra + RaR + ZaXbR + Rb + ROR + Zb) ¢ P, 


because aRb © P. Hence, A ¢ Por BCP, that is, aE Por bE P. 
(b) = (c) Let AB ¢ P, where A and B are right ideals of R. Let A, = RA +A,A,@ 
RB + B, where 


ra={ r ne Rael 
es 4, \F, ‘ 
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RB has a similar meaning. Then A, and A, are ideals in R, and 
A, A, = (RA + ARB + B) = RARB + RAB + ARB + ABCP, 


because AB ¢ P. By (b) A; G Por A, C P. Hence, A ¢ Por BC P. 
(c) => (d) Let ABC P, where A and B are left ideals in R. Let A, = A + AR, A, = 
B + BR. As before, A, A, © P, and A, and A, are ideals (hence right ideals). Thus, by (c) 
A, © Por A,¢ P. This yields A ¢ Por BC P. 
(d) = (a) Trivial. 
7. Let xE Ne, P,. Suppose x is not nilpotent. Consider the multiplicative semigroup 
S = (x,x?,x3,...). Then 0 ¢ S. Let 


€ = (/\/ is an ideal in R with 1N S =O). 

€ is a po set. By Zorn’s lemma @ has a maximal member, say P. We claim P is prime. 
Let A and B be ideals in R such that ABC P. If possible, let A ¢ P and B¢ P. By 
maximality of P, x" € A + Pand x* € B + P for some positive integers m and k. Then 


(A + PXB + P) = AB + AP + PB + P? Cc Pimplies x"** € P, acontradiction. Hence, 
Pis prime. But then x € P, again a contradiction. This proves x must be a nil element. 


Chapter 10, Section 5 
1. Let A™ = 0, B" = 0. Let k = max(m,n). Then we have 
(A + BY = SABA ++ ABS, 


where e, and f, are nonnegative integers, = ,(e, + f,) = 2k. Thus, each term is a product 
of at least k factors all belonging to A or B. Because A* = 0 = B*, it follows from this 
equation that (A + B)?* = 0 and that A + B is a nilpotent ideal. 

3. Let a=(a,,) € F, be an element of a nil right ideal, say A Now write a = La,,e,,, where 
e;, are matrix units. Then (£a,,e,,)(e,,)EA; that ts, 2°, a,e,,EA. writing in matrix 
notation, we get 


a, O 0 
Qo. 0 0 

EA. 
an, O 0 


Then for any positive integer m, we have 


Qik 0 is 0 ” ay 0 one 0 
Q2. 0 + 0 a,,aTy ' 0 eee oO 


Qn 0 : 0 Any | 0. s+ 0 


Because A is a nil right ideal, at, ~0 = a,,a7,'= --- =a,,a%' for some positive 
integer m. This implies a,, = 0. Because k is arbitrary, we get that the first row of the 
matrix a is zero. Similarly, by multiplying La,,e,, by e,, on the right we obtain that the 
second row of the matrix a is zero. Proceeding like this we obtain a = 0. 
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Chapter 10, Section 6 


The proof is exactly similar to that of Theorem 6.1. 


Chapter 11, Section 1 


. Let (a,b) = dand (ca,cb) = e. Then dja = cd|ca. Similarly, cd|cb. Thus, cd|(ca,cb) => 
cdx = e for some x € R. Now ca = ey for some y € R. Thus ca = ey = cdxy. Hence, 
a= dxy, that is, dxja. Similarly, dx|b, so dx|d. This implies x is a unit; hence, (ca,cb) = 
e = cdx = c(a,b). 

. blac => bd = ac for some dE R. Now c = c(a,b) = (ca.ch) = (bd, bc) = b(c,d) = dic. 

. See Example 1.2(d), where it is shown that 2 + 7. — $ is irreducible. Also, (2 + f= $)|3-3, 
but (2 + V—5) 3. Hence, 2 + /—S is not prime. 

. Indeed, both 1+ ./—3 and 1 — /—3 are irreducible. For if 1+ ./—3 =(a+ b /—3) 
(c + d,/—3), then 4 = (a? + 3b?)(c? + 3d?). This impliesa = + 1,6 =O(orc = + 1,d =0). 
Beniaes 10+ Hi (10+ Njs—)_ 91 + 78) 

8 +i (8 + (8 — ) 65 
both sides with 8+é, 10+ 111i =(1 + i)(8 + i) +(3 + 2s). Next, divide 8+i by the 
+1 

+ 2i 

remainder is zero, showing 3 + 2i is the g.c.d. 


bf: d /- 
Considering (“* v-3 \(e 2 ) = 1, we obtain (a? + 3b?)(c? + 3d?) = 16. Now, 


~ Lar we 


wo 


=(1+)+ (; + si) Then by multiplying 


8 
remainder 3 + 2i and oblain - = 2 —i. This implies 8 + i = (2 — i)(3 + 21) and so the 


2 2 
neither a? + 3b? nor c? + 3d? can be equal to 1,2, or 16. Thus a? + 3b? = 4. This yields 
+1 —3 
(a, b) =(+ 2,0), or (+1, +1). This gives that units of Q[./ —3] are +1, ae 


Chapter 11, Section 3 
1. (a) Write b = bg +r, d{(r) < G(b). Suppose r #0. Then b(1 — g) =r implies bir; so 
¢(b) = f(r), a contradiction. Thus, r = 0, so ¢(0) < ¢(b). 
(b) We have ajb and bja; so d(a) = ¢(b) and ¢(b) = da), that is, (a) = f(b). 
(c) Write a= bq +r, f(r) < G(b). Now alb = ax = b. Thus, 
a=bg+r=axqg+r=a(l —xq)"=r=alr 
=> d(a) = f(r) < d(b) = fa) 


unless r = 0. Then dja. Therefore, a and 6 are associates. 
3. Let 0 * a € R be noninvertible. Then the ideal xR[x] + aR[x] is not a principal ideal. 
For let 


xR{x) + aR{x) = f(x)R[x]. (1) 
Then a € f(x)R[x] implies a = /(x)g(x). This gives f(x) = 8, g(x) = y, where By € R. 
Also, 


XE f(x)R[x] = x = f{(x)h(x) = Bh(x) = B(bx) => 1 = fb, DER. 
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Thus, £ is invertible, so f(x) R[x] = BR[x] = R[x]. Then from (1), 
1 = xu(x) + av(x), u(x), ox) € R[x]. (2) 
(2) gives u(x) = 0, and v(x) = c, cE R. But then | = ac, so a is invertible, a contradic- 
tion. Hence, R[x] is not a PID. 
. Apply Theorem 3.2 of Chapter 11 and Theorem 4.7 of Chapter 10. 
7. Indeed every ideal contains a product of prime ideals. Otherwise, consider a family F 


of all ideals which are not products of prime ideals. We claim F has a maximal member. 
Otherwise, we can produce an infinite properly ascending chain 


ee) 


A,CA, CAC: 


of ideals A,eF. But if we consider B=WUA,, B is an ideal, say bR. Let be A,. Then 
A, = A,,, =-::, @ contradiction. Thus F = @. In particular, 0 = P,P,---P,, where P, 
are prime ideals. The uniqueness follows easily. 
9. Since bja implies $(b) < $(a), it follows that if ac = 1 then ¢(a) = (1). Conversely, write 
l =aqg+r. If r #0, then $(r) < d(a) = (1) < d(r), a contradiction. Thus r = 0. 
11. JP = aR. Write a =p, ---p,, where p, are prime elements. Since /P is a prime ideal, 
for some p.e./P. It is easy to check JP = p,R, and P= prR,e>0. 


Chapter 11, Section 4 


1. Let R= F[x]). Then R is a commutative integral domain that is not a field. Then 
R{y] = F(x, y] is not a PID (see Problem 3, Section 3). 

3. First suppose f(x) is a zero divisor in R[x]. Let K = (g(x) € R[x]lg(x)/(x) = 0). We 
want to show KN R * (0). If f(x) € R, then it is clear that KN R * 0. So assume the 
degree of /(x) > 0. If possible, let KN R = (0). This implies that if 0 * cE R, then 
cf(x) * 0. Let g(x) € K be of minimum degree, say m. Write 


(x) = bg t Oyxt ++ + 5x. 
Then 5,, * 0. Also, by assumption, m > 0. Let 
J(X) = an + a,xt+ °°: a,x", a,*0. 


Because b,, f(x) * 0, a,9(x) * 0 for some i = 0,1,...,. Choose p to be the largest positive 
integer such that a,g(x) * 0; that ts, a,,,9(x) = 0 = a,,.9(x) = °*> = a,g(x). Then 
O = f(x)g(x) ™ (ag t+ ayxt + +a,x?+a,,,x9t! + --> +4,x")e(x) 

= (do + a,x+t+ -** + a,x?)g(x) 
implies a,b,, = 0. Thus, the degree of a, g(x) < m. Moreover, (a,g(x))/(x) = 0. Hence, 


a,g(x) € K, degree a, g(x) < m, a contradiction. Thus, KN R * 0, so there exists b € 
R, bf(x) = 0. The converse is clear. 


Chapter 12, Section 1 


1. Define 4: Rs — R’ by h(a/s) = g(aXg(s))—'. Then hf(a) = h(a/1) = g(a), so hf= g. 
3. Let A be an ideal in Rs. Let A° = (a € Rla/s € A for somes € S$}. Then A‘ is an ideal in 
R. Next, let A and B be ideals in R, such that A ¢ B. We show that A¢ ¢ B*. This will 
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prove an infinite properly ascending chain of ideals in R, gives rise to an infinite 
properly ascending chain of ideals in R. This proves the required result. Let A ¢ B. 
Clearly, A° C B®. If possible, let A‘ = B*. Choose b/s € B, b/s € A. Then bE Be = A. 
Thus, there exists s, € S such that b/s, € A. This implies (b/s,) « (s,/s) € A; that is, 
b/s € A, a contradiction. Hence, A ¢ B = A‘ g B*, as desired. 

5. Note that R(x) # R(x, y). 


Chapter 12, Section 2 


1. We first claim that R cannot possess an infinite properly ascending chain of right ideals. 
For. let a,Rcoa,Rca,Rc-:- be a properly ascending chain of nght ideals Let 
A =Us_,a,R. Then A isa right ideal, so A = cR forsomec € R.NowcE€ Ace a,R 
for some i. Then cR C a,RC A= cR. Thus, A = @,R. This yields a,R = a,,,R™...,a 
contradiction. Further, if possible, let /, J be nonzero right ideals such that 7/N J = (0). 
Let 0 * ae J. Then for any positive integer m, =f. , a*/ is a direct sum of right ideals. 
This yields an infinite properly ascending chain of right ideals, 


alcal@a@lcal@ai@aic-::, 


a contradiction. Hence 1M J * (0), which shows that R is a mght Ore domain. 
3. Let R be an integral domain satisfying a standard identity 


Sy(Xp sees Xn) = SY EX, Xi Xs,» 


where the summation runs over every permutation (i,,...,/,) of (1,...,7), and the sign is 
positive or negative according to whether (i,,...,/,) is an even or an odd permutation. 

We first show that any direct sum © 4., A, of right ideals A, cannot contain more 
than n — | terms. If possible, let 4,© --: © A, be a direct sum of n right ideals. Let 
0*a,EA,, 1s isn. Then S,(a,,...,.a,) = 0. Rearranging the terms in S,(a,,....4,), 
we have 


a,S,- 1(23,...,4_) baal a,5, ~(@3,..-;8_,84) + a;S, —(Qq5..48n,Q, 4) — +s  (Q), 


This implies that @,5S,_ ,(@2,....€,) = 0, @,5,-;(a3,....€2,)™=0, and so on, because 
A, ® -*: OA, isa direct sum. But then S,_ ,(@2,...,.a,) = 0 forall0 #a,EA,,2sis 
n. Continuing likewise, we obtain a, = 0, a contradiction. 

Now, we show that if A,B are right ideals, then A N B ¥ (0). If possible, let AN B= 
0. LetO * c € B. Then £7. ,c'A is a direct sum of n right ideals, a contradiction to what 
we proved before. Hence, AM B ¥ (0), proving that R ts a right Ore domain. Similarly, 
we can show that R ts a left Ore domain. 


Chapter 14, Section 1 


1. (R[x],+) isan additive abelian group. Let r € R, f(x) = ag + a,x + °°: +a,x"E R[x]. 
Define rf(x) = ray + (ra,)x + -:: +(ra,)x". Then, clearly, 
(r, + 12) f(x) = 7, f(x) + AS (>), 
A(x) + S40) = f(x) + 00, 
(rir) f(x) = (7, S(x)), 
f(x) = f(x) iflER. 


Hence, R[x] is a left R-module. 
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3. Misa Z-module by defining 


a times 


Ss 
m+o-s +m, a>0O, 
—a times 


| tie 
—m+--+—-m, a<0, 
0, a=0, 


where a € Z, mE M. Suppose M is also a Z-module by a mapping f: Z X M — M. 
Then by axiom (iv) for a module, f(1,m) = m. Also, by axiom (ii), ifa € Z, a > 0, then 


a times a times 


a, Ge 
S(a,m) = f(L + +++ +1, m)=f(i,m) + +: +f(l,m) 
= af(1,m) = am. 


Next, using axiom (ii) again, we have 

f(—1,m) = f(— 1 + 0,m) = f(— 1,m) + f(0,m). 
Hence, f(0,m) = 0. Further, 

0 = f(0,m) = f(1 + (— 1),m) = f(l,m) + f(— 1m), 


so f(— 1,m) = —f(U1,m) = —m. Then f(—a,m) = —am, a> 0. Hence, for all a € Z, 
f(a,m) = am. This proves that there is only one way of making M a Z-module. 


Chapter 14, Section 2 


1. (a) W= ((0,A,,A5)|A2,Ay € R) is clearly an additive subgroup of (R',+), and if a € R, 
(0,4,,A) & W, then a(0,A,,A;) = (0,aA;,aA,) € W. Hence, Wis a subspace of R?. 
(b) W= ((A,,A,,A5)]A,,4, € R) is clearly a subspace. 
(c) W= ((A,,Ag,A5)IA; 42,45 € R, A,A, = 0). Now (1,0,1), (0,1,1) € W. But (1,0,1) — 
(0,1,1) = (1,— 1,0) & W, because 1,— 1 #0. Thus, W is not a subspace. 
(d) W= ((2A, — 1,49,45)Ag.43 ER). Now (1,1,I)EW and (3,2,1)€W. But 
(3,2,1) — (1,1,1) = (2,1,0) € W. So Wis not a subspace. 
(ec) W=((A,;,A2,43) € RA, + 4,20). Now (1,2,3)EW, but —(1,2,3)= 
(—1,—2,—3) & W. Thus, W is not a subspace. 
3. (a) W=(f: R—-RUf(1)=0). Let feew, aER. (f— kX!) =f) —a(1)=9, 
(af K1) = a( f(1)) = 0. Thus, f— g, af Ee W. Hence, W is a subspace. 
(b) W=(f: R > Rif(0) = 1). Let fig © W. Then (f— gX0) = f(0)-— g(0) = 1—-1= 
0. Hence, f— g ¢ W. Thus, W is not a subspace. 
(c) W=(f:R — RU(3) = 2f(2)). Letfhge W,aeER. Then 


(f— gX3) = f(3) — g(3) = 2( f(2) — g(2)) = 20 f— gX2), 
and 
(af 3) = a( f(3)) = a(2f(2)) = 2(af K2). 


Thus, f— g, af€ W. Hence, Wis a subspace. 
(d) W=(f: R — Rif(3) 2 0). Now (—f 3) = —f(3) s 0. Hence, —/¢ W. Thus, Wis 
not a subspace. 
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5. 


13. 


(e-) W=(f: R-Rilim,.., /(0 exists). Clearly, if fee W, then f— ge W, and if 
aeéR, then afe W. Hence, W is a subspace. 

Let A = (x € RixM = 0). Suppose x,y € A andre R. Then (x— y)M=xM—yM= 

0, (xr)M C xM = 0, and (rx)M = r(xM) = 0. Hence, x — y,xr,rx € A. This proves that 

A is an ideal in R. 


. Let (a,b,c) € V. Then 


(a,b,c) = (a = b)x, + (b — c)X2 + Cy E Rx, + Rx, + Rx;. 


Thus, V = Rx, + Rx, + Rx,. Now if 0 = ax, + Bx, + yx;,0,8,yE R, then at+ B+ 
y= 0,8+y=0,y = 0. Thus,a =0 = £= y. Hence, Rx, + Rx, + Rx; isa direct sum. 


. The ring R has eight elements: 


anon(§ 9 a-(6 9} #(9 3} (6 9) 
an(id ar(6 h o-( a6 9) 


Here a; ,@2,24,25,Q,,2, are idempotents. Because a left ideal A is a direct summand if 
and only if A = Re for some idempotent e € A, it follows that 


ra={(° s)} ea-{(2 6 9} 
raw {(> o)(0 ¢)( obo i} 
ra-{(3 $)(6 a) 


and Ra, = R are direct summands of R as a left R-module (note that the left ideal 
generated by a, does not yield another direct summand, because Ra, = Ra,). 


. Let a,b E R. Then 


ae + bf = ae + b( f— fe) + bfe= (a+ bf )e+ b( f—fe). 


Thus, Re + R/C Re + R( f— fe). Similarly, Re + R( f— fe) C Re + Rf. Hence, Re + 
R( f— fe) = Re + Rf. Next, let aE Ren R( f— fe). Then a = xe = y( f— fe). This im- 
plies xe? = y( f— fe)e = 0; that is, xe = 0. Hence, ReN R( f— fe) = (0), which proves 
that Re @ R( f— fe) = Re+ Rf 

Let J = Rx, + +> + Re, and/ = J*. Then / = Ix, + +++ + Ix,. So there exist a,,€ / 


such that 

Xj Ay Xy HF oo FAiyAX,y, Isisn; 
$o 

(1 = Ay4)X, a + 0,,X, = 0, 


QniX; +--+» +(1—a,,)x, #0. 


Because R is a commutative ring, by Cramer's rule for solving a system of linear 
equations, we have 


I—-a, - Qin 
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Hence, (1 — z)x, = 0 for some z€ /. This yields that (1 — z)/ = 0, so / = z/, where 
z= z? and this proves that / is a direct summand. 

Let R = R® be the ring of functions from R to R. Let / = (fE RI f(a) = 0 for all but 
countably many a € R). Then /is an ideal of R that is not finitely generated; /? = /, but / 
is not a direct summand of R. 


Chapter 14, Section 3 


I. (a) Let x,ye Ker f, aE R. Then f(x — y) = f(x) — f(y) = 0, flax) = af(x) = 0. So 
x — y,ax & Ker f, and, hence, Ker fis an R-submodule of 7. 
(b) Let f(x), f(y)E lm f, aE R. Then f(x) — f(y) =/(x — y)E lm f, and af(x) = 
f(ax) € Im f, Hence, Im fis an R-submodule of M. 
. Letqg€ Q. Define a mapping g*: Q — Q defined by g*(x) = gx. Then 
q*(x + y) = Q(x + y) = gx + gy ™ q*(x) + g*(y) 
where x, y € Q. Also, for a € Z, 
g*(ax) = g(ax) = a(gx) = aq*(x). 
Thus, g® is a Z-homomorphism of Q to Q. Next, let us define a mapping /: 
Q — Hom,(Q,Q) by /(q) = 9°. Let q,4,,x € Q. Then 
(9192)°(X) = (91 92)% = G,(G2X) = 94(G3(X)) = GP (GE(X)) = (GTP GFX). 
Also, 
(9, + 92)°(X) = (Gy + G2)x = GX + G2x = QT(X) + QF(X). 
Hence, / isa ring homomorphism. / is also injective, forg*® = Oimplies g*(x) = 0 for all 
x € Q. In particular, g°(1) = 0; that is, g = 0. Finally, we show that / is surjective. Let 
o € Hom, (Q,Q), and let o(1) = g. Let a/b € Q. Then 
q=o(1)= os . i) “= bo( +) implies ; = o(t). 


Further, 


o(5) eo(5) ne (5). 
Hence, o = q* = f(q); that is, fis surjective. This proves Q = Hom,(Q,Q). 

5. Define a mapping f/: F"— V by /(a,,...,.€,) = a,X, + «°° +a,x,. fis clearly an 
F-homomorphism. / ts also injective, for a,x, + :°: + a,x, = 0 implies, by hypoth- 
esis, that (a,,...,@,) = 0. Also, since V is generated by x,,...,x,, / is surjective. Thus, 
Fam V. 

7. Letd: R — R/I be an isomorphism. Let x,u € R be such that d(x) = 1 + J and d(1) = 
u+ J. Then for all a € J, d{ax) = a(x) = a + 1 = 0. Because ¢ is injective, ax = 0. 
Thus, /x = 0. Also, 1 + J = G(x) = xd(1) = xu + I. Then 1 — xu € J. Because [x = 0, 
(1 — xu)x = 0; thatis, x = xux. Let {= xu. Then fis an idempotent and | — fe /. Also, 
[f=0,sol=R1-f)=Ree=1—-S. 


we 


Chapter 14, Section 4 


3. Z/(D, Pp) = ZK py) X Z/(p,). lf A is a Z-submodule of Z/(p,), then A is also a Z/(p,)- 
submodule of Z/(p,), because A(p,) = (0). But then A = (0) or A = Z/(p,), because 
Z/(p,) a8 Z/( p,)-module has no nontrivial submodules. Thus, Z/(p,) and, similarty, 
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Z/(p,) are simple Z-modules, which proves that Z/(p, p,) is a completely reducible 


Z-module. 
5. R/M = (6 °)@(° 0), and each (F 0), (0 9) is a simple R-(as well as) R/M-module. 


Chapter 14, Section 5 


1. If xe is any element in a basis (if it exists) of Re, then (1 — e)xe = x(1 — eje = 0 implies 
1 — e = 0, a contradiction. 
3. Follows from the definition of basis of a free module. 
5. zQ free implies Q = Z © K for some Z-module K ¥ (0). 
Q = Hom,(Q,Q) = Hom,(Z @ K, Z@ kK), 
but the latter (which is isomorphic to Q) has nontrivial idempotents, a contradiction. 
. Rx xR under natural mapping, if x #0. 
9. Take in Problem 4, M=A, N=B=F, and f =identity map. Then there exists a 
homomorphism h:B-- A such that gh = /. It can be shown now A = Kerg@Imh. 


~3 


Chapter 14, Section 6 


1. (a) D(1) = 0 + Ox + Ox? + Ox?, 
D(x) = 1 + Ox + Ox? + 0x’, 
D(x?) = 0 + 2x + Ox? + 0x3, 
D(x?) = 0 + Ox + 3x? + Ox?. 


The matrix of D with respect to &, is 


(b) D1) = 0, Dix +1) = 1, Dix + 1)? = Ax + 1), D(x + 1)? = Hx + 1). Thus, the 
matrix of D with respect to , is 


ooo & 
ooo- 
oon }& 


(c) 1 = 1+ Oe + 1) + Ox + 1)? + Ox + 1), 
Xe —1 t+ i(xt 1) + Ort 1)2?+Xxr4+ 1p, 
x2 | — 2x + 1) + M(x t+ 1)? + Xx + 1), 
xd me — f+ Hx + 1) — xt 1)? + (xt 1). 


Thus, the matrix of transformation of basis @, to basis 9, is 
es | i =] 
0 1! -2 3 
0 0 1 -3 
0 0 O 1 
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2 0 
3. (a) k | 


cosa sina 
ecard 
—sina cosa 
2+ sin? a + 2sinacosa cosasina -—-2sin?« 
(c) 2 e 2 . * 
2cos* a + sin acosa 2 + cos‘ a— 2sinacosa 


Chapter 14, Section 7 
1. (a) The matrix of @ with respect to standard bases is 


2-1! 3 ! 
A={[!1 -8 6 8 }. 
! 2 O -2 


If R,,R,,R; denote the first, second, third rows, respectively, then by considering 
aR, + BR, + yR, = 0, we obtain that —2R, + R, + 3R, =0.So R,, R2, and Ry 
are linearly dependent. But aR, + BR, = 0 implies a = 0 = f. Thus, row rank 
A = 2; hence, rank @ = 2. 

(b) Proceeding as in (a), rank @ = 3. 

3. (a) Let V={x{Bx =0}, W={x|ABx =0}. Then dimV=—n— rank B=q, say; and 
dim W =n — rank (AB) =r, say. Clearly Vc W. Choose a basis x,,. .,x, of V and 
extend (if necessary) to a basis x,,...,%4, Xgea.---.X, Of W. Then Bx,,,,..., Bx, is 
a basis of U = {Bx|ABx =0, xeF*}. Thus dim U =r — q = rank(B) — rank(AB). By 
the rank nullity theorem, dim U < n— rank(A) This proves the desired inequality. 

(b) Follows by problems 2(a) and 3(a) on choosing B= 1 — A. 
5. Now ATAx = 0=>x"ATAx =0=0(Ax)"(Ax) = 0 Ax = 0. Thus nullity (A7 A) = nullity(A) 
and so by the rank nullity theorem, rank(A’ A) = rank A. 


Chapter 15, Section 1 


1. Let f(x) = x> + 3x + 2, and 0,1,2,3,4,5,6 be the elements of Z/(7). By Proposition 1.3, 
S(x) is reducible over Z/(7) iff f(x) has a root in Z/(7). It is easily checked that none of the 
elements 0,1,2,3,4,5,6 are roots of /(x). So f(x) is irreducible over Z/(7). 
3. Apply Theorem 1.7. 
$. (a) The only irreducible polynomial of degree 2 is x? + x + I. 
(b) Irreducible polynomials of degree 3 are of the form f(x) = x> + bx? +cx+ 1, 
where b,c € Z/(2). So(b,c) can be (1,0), (1,1), (0,0), (0,1). For f(x) to have no root in 
Z/(2), we must have (6,c) = (1,0) and (0,1). So the irreducible polynomials of degree 
3 are x? + x? + 1 and x°+x+ 1. 
(c) Polynomials of degree 4 with nonzero constant term are of the form 


S(x) = x4 + bx? + cx? + dx + I. 
If f(x) is reducible over Z/(2), then f(x) must be one and only one of the following: 


S(X) = (x2 +X + U2 + x + 1) = x4 + x27 4:1; (b,c,d) = (0,1,0); 
S(X) = (x3 + x2 + Ix + 1) xt + x27 4x41; (b,c,2) = (0,1,1); 
S(X) = (8 ++ UY + 1D) x8 $+ x3 + x24; (b,c,d) = (1,10); 
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S(x) = (x27 ++ Ix t 1)? xt t+ x8 + x41; (6,c,d) = (1,0,1); 
S(x) = (x + 1S = x4 + 1. 


So the irreducible biquadratics are those corresponding to (b,c,d) = (0,0,1),(1,1,1), 
and (1,0,0); so they are x4 + x + 1, x4+ x9 +x27+x+ 1, and x4+ x°+ 1. 


Chapter 15, Section 2 


1. 


Let 0,1,2 be the elements of Z/(3). It is easily seen that none of these are roots of 
x?—x-— 1. Hence, x? — x — 1 is irreducible over Z/(3). 

Consider the set K = (a + buja,b € Z/(3)), where u2 — u — 1 = 0. Then Kis a field 
containing Z/(3) that contains one root and, hence, both the roots of f(x). 


. The required extension is the field 


K = (a+ bu + cu? + dua,b,c,d E Q), 


where u4 — 2 = 0. We can write u = V2. 


. We invoke the Kronecker theorem (Corollary 2.4) to get successive fields K, C 


K,c - CK,, such that K, contains a root of f;(x), K; contains a root of f(x), and so 
on. 


Chapter 15, Section 3 


FC KC E. Now ais algebraic over K = a is a root of an irreducible polynomial, say 
f(x) = ag ta, + °° +a,,x", a,€K,i=0,...,m. 


Consider the field L = F(do,...,€,,). Because K is algebraic over F, each a, is algebraic 
over F; so L is a finite extension (and so algebraic) of F (Theorem 3.6). Now a is 
algebraic over L = L(a)isa finite extension of L = [L(a): L] < ©, so, by Theorem 2.1, 
[L(a): F) = (L(a): L){L: F) < 


This implies each element of L(q) is algebraic over F, and, in particular, a is algebraic 
over F. 


. (a) Let u= J2 +5. Then (u — 5)? = 2; that is, u2— 10u + 23 = 0; so the minimal 


polynomial of /2 + 5 over Q is x? — 10x + 23. 
(b) u=3¥2+ 5 =>(u— 5)? = 18; that is, u2 — 10u + 7 = 0; so the minimal! polyno- 


mial is x? — 10x + 7. 

(c) w= V—1 + V2 => (u? + 1) = V2 = 4 + 22 — 1 = 0; so the minimal polynomial 
is x4+ 2x?— 1. 

(d) u= V2 —3V3 = u2 = 2 +27 — 6 V6 => (u? — 29)? = 36-6 = u* — S8u? + 625 = 0. 
So the minimal polynomial is x* — 58x? + 625. 


. FC E,a,b€ E. Now [F(a,b): F) = [F(a,b): F(a)} > (F(a): FJ, which shows that m di- 


vides [F(a,b): F]. Similarly, n divides [F(a,b): F). Because (m,n) = 1, mn{[F(a,5): F). 
But [F(a.b):F] =[F(a,b):F(a))-[F(a):F] shows that [F(a.b):F]<mn. Hence, 
[F(a,b): F} = mn. 


. Let (F(a):F) = nand (F(a): F] = m. Obviously m sn. Let f(x) = ag ta,xt+ oo + 


x™ be the minima! polynomial of a? over F. Then ay) + a,a? + °°: +a,,a°"=0= a 
satisfies the polynomial a) + a,x? + °*: +.4,,x2" € F[x]. So n|2m. Because n is odd, 
n|jm. But ms n. Hence, m = n. 
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9. We have a tower of fields 
F(x) 


L 


F 
Let y = p(x)/q(x) € L, p(x), 0 * gx) € F[x], y ¢ F. Then p(x) — yq(x) = 0. This shows 
that x is algebraic over L. Thus, F(x) is algebraic over L. 


2n |? 2 2 2 Z 
tt. | o—cos | = —sin?—. So «w*—2wcos-- +cos? -- = —1 +cos? —, yielding 
n n n n n 


2 
w? — 2w cos = + | =0. Thus w satisfies an irreducible polynomial of degree 2 over Q(u). 
n 


Chapter 16, Section 1 


1. (a) Let 1 *w be a cube root of |. Then the splitting field of x> — 1 over Q is K. 
K = Q(w), where w = (— 1 + ¥3i)/2; that is, K = Q(v3/). 

(b) x4 + 1 is irreducible over Q, because (x + 1)* + 1 = x4 + 4x3 + 6x2 + 4x + 2 is 
irreducible over Q [take p = 2 in the Eisenstein criterion]. Also, w = cos(x/4) + i 
sin(2/4) = (1/ ¥2\1 + i) is a root of x* + 1, and the other roots are 

w= cos + isin = 1 + i), 


v2 


w= (I - i), 
wt = (1 — i), 


so the splitting field is Q(v2,/). 
(c) The splitting field is O(V3i). 
(d) The splitting field is Q(./2, 3/3,«), where w=(— 1+ /3i)/2. In (a), [K:Q] = 2 in 
(b) {K:Q] = 4; in (c) {K: Q] = 2; in (d) (K:Q] = 12. 

3. f(x) = x) + ax + b must be irreducible over Q, otherwise the degree of the splitting 
field of f(x) over Q is 1 or 2. So let f(x) be irreducible over Q, and let f(x) = 
(x — a,x — a,x — a). Then the splitting field of f(x) over Q is K = Q(a, ,@2,@3). 
From the equation 
Q,+a,+a,;70, aya, + a,a; + a30, = 4a, a,a,a, = —b, (1) 
we calculate 
D = ((a, — a, Xa, — aX — @,)]? = —4a> — 276’, 


which is called the discriminant of x3 + ax + b. If VD € Q, then the splitting field of f(x) 
is K = Q(a), where a is any one of a,,a@,,@,. For suppose a = a,. Then 


(a, — aKa, — aKa, — a3) EQ = (a, — a,)[a,a, — aa, + a) + a7] EQ; 
that is, a, — a, € Q(a), from (1). Also by (1), a; + a, € Q(a), 80 a; ,a@,,a, € Q(a); 
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that is, Q(a) is the splitting field, and [Q(a): Q] = 8 of the irreducible polynomial 
f(x) = 3. If VD ¢ Q, then the splitting field K is Q(VD,a) and [K:Q] = 6. Thus, D = 
— 4a3 — 27? must be a square in Q in order that [K:Q] = 3. 


. We use the following result: If u satisfies a polynomial of degree m over F, then 


[F(u): F] s m. Letf(x) € F[x] be of degree n over F. Let a; ,@3,...,@, be the roots of f(x) 
in £. Then 

[F(a,): F] sn, 

[F(a a2): F(a,)] s(n — 1), 

[F(Q, ,-050,) 2 F(Q; ,...5X,—1)] S 2. 

So 


[F(Oe) 06g 500 q) 2 FY] [F (01 5.0504) 2 F(Qy 5-1-1) [F( Oe, 5---5Qp—1)! F(Qy 5--1@n—2)] 
 [F(a,):F]<2:3---n=n!. 


. Leta € E bea root of p(x). Let 8 be another root of p(x) in some extension of F. Then 


since F[x]/( p(x)) = F(@), where 0 is any root of p(x), it follows that F(a) = F( 8). Wecan 
regard f(x) as a polynomial over F(a) as well as over F( 8). Then E is also a splitting field 
of f(x) regarded as a polynomial over F(a). Let K be a splitting field regarded as a 
polynomial over F( 8). Then by the uniqueness of splitting fields (up to isomorphism), 
we have E = K. Further, because (x) as a polynomial over F splits in E, it follows that 
splitting field of f(x) over F( 8) must be contained in E( 8); that is, K C E(B). However, K 
contains all the roots of f(x), including £. Thus, K D E( 8). Therefore, K = E( §). Fur- 
ther, 


[E: F) = (E: F(a))[F(a): F] = ([K: F(B)\F(B): F) = (K: F) = (E(B): F). 
Therefore, E = E(f), so BE E. 


9. f(x) is irreducible by Eisenstein’s criterion. Choose any two roots, say, « = fi + Aer 


B = /1 —.,/3. Since the minimal polynomial of « and is the same, Q(a) ~ Q(A). 


Chapter 16, Section 2 


1. 


(a) The minimal polynomial of V—2 over Q is x? + 2, its roots in C are V—2, — V—2, 
and both € Q(v—2). Hence, Q(v—2) is a normal extension of Q. 

(b) The minimal polynomial of 5 V7 is x? — 175, whose roots are 5 V7 and — 5 V7, and 
they lie in Q(5 ¥7). So Q(5 V7) is normal over Q. 

(c) By a similar argument, Q(V—1) is normal over Q. 

(d) If xis not algebraic over Q, then Q(x) is not an algebraic extension of Q. So Q(x) 1s 
not a normal extension of Q. 


. Eis a normal extension of F => E is a splitting field of a family of polynomials over 


F = E is a splitting field of a family of polynomials over K, because F C K, and any 
polynomial over F can also be considered as a polynomial over K. Hence, E is normal 
over K. 

Take K= Q(¥2) and E= Q(V2,). Then Qc Q(v2 Cc Q(¥2,w), where w = 
(—1 + V3i)/2. Clearly, Q(V2) is not normal over Q, but aye 2,@) is normal over Q(2). 


. Let Ebea finite extension ofa finite field F, and [E: F] = n. Then E is also a finite field, 


and E*, the multiplicative group of E, is cyclic and generated by u, say, such that 
u"-' = 1. Thus, u is a root of x" — x € F[x]. All the other roots of x" — x are 0 and 
powers of u, and so are in E. Hence, Eisasplitting field of x" — x € F[x], and, therefore, 
E is normal extension of F. 
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7, x? —x—1 is irreducible and has only one real root (consider graph of y= x>—x-— | 
or refer to Chapter 16, Section 1, Problems 3 and 4). Thus Q(a) is not a normal extension 
because its minimal polynomial x* — x — | does not split into linear factors over Q. 

9. The smallest normal extension of Q(2'/*, 3'/*) is the splitting field of (x* — 2)(x* — 3), 
namely, Q(2!/*, 3!/* i). 

11. Follows from the fact that any field E is normal over F iff each embedding o:E — F that 
keeps elements of F fixed sends E to E. 


Chapter 16, Section 3 
1. Let f(x) = ag + a,x + a,x?+ +++ and g(x) = by + b,x + Ob. x2 + +++. Then 
S (x) + g(x) = (do + bo) + (a, + O,)x + 2+ + (a, + b))x'+ o 


= 3 (a, + b,)x'. 
/=0 
By definition of the derivative, 


(f(x) + g(x)’ = Sia, + b)xi l= S ia,x!! + S ibjx! 
ims i=! 


= f"(x) + g’(x). 
3. Suppose f(x) € F[x] has a root a of multiplicity 1. Then 
f(x) = (x — a)"P(x), = Ha) ¥ 0. 
If d(x) € F, then f(a) = 0, k = 1,...,.2 — 1, and f(a) = n! # 0. Suppose P(x) is of 
degree = 1. Then 


SL OXX) = n(x — a)"—'" G(x) + (x — a)"h'(x) = (x — a@)""'G,(0), (ax) # 0 
Sx) = (x — a)"-2*(x), O(a) * 0, 


LO- YX) = (x — a)h,-100), $1 (a) #0, 
S%x) = hy (x) + (x — a)pi,_ (x). 


Thus, f~ (qa) = 0 and f(a) = ,_ (a) # 0. The converse is obtained by retracing 
the above steps. 


Chapter 16, Section 4 


1. The mapping @: x +» x° is an endomorphism of the ring F, since (x + y)? = x? + y? 
and (xy)? = x*y?. @ is also 1-1, since Ker ¢, an ideal of F, must be (0). Because |F| is 
finite, dis 1-1 and onto. That is, g is an automorphism of F. Thus, ifa € F, 36 € Fsuch 
that b? = q. The unique d is called the pth root of a and written Va. 

3. By Theorem 4.3, GF(2°), the Galois field with eight elements, is the splitting field of 
x® — x over GF(2). Also, the multiplicative group of GF(23) is cyclic, of order 7, and 
generated by any element 5 #1. So GF(2°)= {0,6,...,6’), where 1+5+--- + 0° =0, 
dF = 6. 


GF (13) = the set of integers modulo 13 =(0, 1,2,..., 12), 
which Is generated by 2. 
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The field F of four elements is the set (0,1,a@,1 + a), where 1 + a + a? = 0, An irreduc- 
ible polynomial of degree 2 over F is x? + ax + 1. An irreducible polynomial of degree 
3 over F is x? + x? + ax +a+ 1. An irreducible polynomial of degree 4 over F is 
x4 + ax? + ax? + (at 1)x 4+ 1. 


. Let|F|= p*. First assume that p ¥* 2. For any fixed a & F, let A = (a — x?|x € F) and 


B=(y*|y & F). Note that |A| = (p" — 1)/2 + | =(p"+ 1)/2. Also, {B| = (p* + 1)/2. 
This implies that 4 M B # ©. Hence, a can be written as a sum of squares. Next, we 
assume that p = 2. Then the mapping ¢(x) = x? of F into Fis bijective. So ifa € F, then 
there exists x € F such that a = x?. 


_ If f(x) =x? —x—1, then f'(x) #0. So the roots of f(x) are distinct. Further, if « is a 


root, then « + | is also a root. Thus the p roots may be written as a, a + 1,...,a+(p— 1). 
If c€Z,, then all roots lie in Z, and so 0 must be a root of f(x), which is not true. This 
shows Z,(«) (# Z,) is a splitting field of f(x) over Z, and [Z,(«):Z,] = p, proving f(x) 
is irreducible over Z,. 


Chapter 16, Section 5 


3. 


2. 


7. 


V3 — V2 = 1/(V3 + V2) eo + V2), so ¥3,¥2 € Q(v3 + V2); that is, Q(v3,v2) c 
Q(V¥3 + V2). Trivially, Q(V¥3 + V2) C Q(v3,v2). Hence, Q(Vv2,V3) = Q(Vv2 + V3). 

Put 6 = V2 + w. Then 6? = 2 + 2V¥2w — (w + 1) € Q(8). This implies w(2 V2 — 1) € 
Q(6). So (2 ¥2 — 1)3 € Q(A); that is, 8- 2V2—3+4-24+3-2V2—1 © Q(6); that 
is, ¥2€ Q(6). This together with c(2V2—1)€Q(6) gives w €Q(6). Hence, 
Q(v2 + w) = Q(v2,0). 

Let F be a field with p™ elements and E be an extension of F having p* elements. Then 
E = F(a) where ae E and so a" —a =0. This implies « is a separable element, and hence 
F(a) is a separable extension of F. 

x’ —x—1 or any of its factors cannot have repeated roots. 


Chapter 17, Section 1 


1. 


(a) S, = (1,0), 02: ¥2 + ¥2w?, w — w?. The basis of E over Q is 
(1 V2, 4,0, V2,w V4). Let 


a=x, + X_V2 + x3V4 + x,w + Xo V2 + x, V4 = Q(¥2,w), 
x, € Q, i= 1,2,...,6. 


Then o,(a) = @ gives 


xX, + x(¥2w?) + x(V2w? . ¥2w?) + xqo" + x0*V2e + x04 V2? ; ¥20?) 
= x, + xX, V2 + x, V4 + x, + x, V2 + x, V4; 


that is, 


—x, + V2(—x, — x2) + V4(x6 — x3) + O(-X, - 
- ve ; ° - i ae +w Va(— x, + x;)=0, 


which gives 

x4 =O0=X, X3™ Xe. 

So 

a=x, +x,V4 + x,wV4 + x,wv2. 
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Thus, 
a= x, + x, V4(— w?) + x V2 = Q(w V2). 


Conversely, if aE Q(w 12), o,(a) = a. Hence, Es = Q(w? V4). 
As in (a) we can show that if 


a=x, + x, V2 +x, V4 + x,w + x50 V2 + x0 V4 Es, 


(b 


ww 


then x, = 0 = x,, and x, = x,. Therefore, 
a= x, + x, V1 + w) + x,w V4 =x, — x,w?¥2+ x,wV4e Q(w? V2). 


Also, Q(w? V2) C Es,. Thus, Es, = Q(w? V2). ; 
(c) With a as in (a), o,(a&) = a gives x, = X; = x, = 0. Therefore, Es, = Q(V2). 
(d) As in (a), we can show that if 


a= Xx, + x, V2 + x3 V4 + xu + x50 V2 + xp V4 © Eg, 


then x, = 0 = x3 = X5 =X; S0@ = X, + x,w E Q(w). Hence, Fs, = Q(w). 


Chapter 17, Section 2 


I. (a) G(K/Q) = (1,0, ,02,03), where 


a,:¥3 + — V3,V5 — V5, 
a,:V3 + —V3,v5 + —N5, 
o,:¥3 — ¥3,V5 + -V5. 


(b) G(Q(a)/Q) = (1,0), where 0: a — a@?. 
(c) x4 —- 3x7+4=0. So 


bt -7_342/-74 (S12 + J 1/2)", 
2 2 2 : 


hence, x = +(V7 + i)/2. Thus, the roots of x* — 3x2 + 4 are 
V77+i -V7-i VI-i —-V7+i 


2 2 2 2 : 
The splitting field is E = Q(V7,i), and (E:Q] = 4. The Galois group obtained ex- 
actly before is the set of automorphisms (1,0, ,02,0;), where 


0; ‘ 7 placed v7, i — l, 
o,:V7-~—-V7, ii, 
03: v7 = v7, =. 
3. Write down a composition series of the Galois group G(Q(u)/Q) by considering Sylow 
2-subgroups. 


Chapter 18, Section 2 


1. Let w be a primitive nth root of unity in F. Then |,w,w2?,...,w"~' are the n distinct roots 
of f(x) = x" — 1 € F[x]. So f(x) does not possess multiple roots in F, and so [(x) = 
nx"—! #0. This implies that either char F = 0 or char F J 7. 
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3. Let char F = p. Suppose x? — b € F[x] is reducible over F. Let a be a root of x° — bin 

its splitting field E. Then x° — b = (x — a)?. Thus, by hypothesis, some factor (x — a)’ 

of (x — a)” belongs to F[x]. This implies — ra € F, yielding a € F. Therefore, E = F. 

Now let char F ¥ p. Ifaisa root of x° — binits splitting field E, then a,aq,...,aw?! 

are the p roots of x? — b, where w is a primitive pth root of unity in E. Suppose 

f(x) € F[x] is a factor of x? — b, where degree f(x) = r > 1. The product of the roots of 

f(x) is of the form a = a’w,, w? = 1. Thus, a = a’w, € F,so a? = a’? = b',0<r<p. 

Choose integers s,f such that rs + pt = 1. It then follows that b = 5b?! = g?*b' = 
(a*b')? = B°, say, where £ € F. Therefore, £ is a root of x? — b; so E = F(w). 


Chapter 18, Section 3 


1. (a) F(x) = x* —9x +3 has a real root in the interval (0, 1) and another real root in (1, 2). 
(Use the intermediate value theorem of analysis.) By Descartes’s rule of signs the 
number of positive roots <2, and the number of negative roots =1. So f(x) has 
exactly two nonreal roots. By Theorem 3.6 the Galois group of f(x) is isomorphic 
to §,. Hence, by Theorem 3.2, f(x) is not solvable by radicals. 

(b) Let p(x) = 2x5 — 5x4 + 5. If we put y = 2x, then p(x) transforms to f(y) = (5 — 
Sy4 + 80). Let e(y) = yp — S5y* + 80. Then the roots of f(y) or those of g(y) are 
twice the roots of p(x). So the solvability of p(x) by radicals is equivalent to the 
solvability by radicals of g(y). The purpose of this transformation was to enable us 
to be able to apply Theorem 3.6, which holds for monic irreducible polynomials. It 
is clear, by Eisenstein’s criterion, that g(y) is irreducible over Q. 

Further, by Descartes’s rule of signs, the number of positive (negative) real roots 
is <2 (1). Therefore, the total number of real roots <3. But by the intermediate 
value theorem, g(y) has three roots, one in each of the intervals (— 2,— 1), (2,3), and 
(4,5). Hence, there are exactly three real and two nonreal roots of g(y) (and so of 
p(x)). Thus, by Theorem 3.6 the Galois group of p(x) is S,;. This proves p(x) is not 
solvable by radicals. 

(c) f(x) = x5 — 8x + 6. As explained before, there are at most three real roots. There is 
one real root in (0,1) and one in (1,2). Hence, there are exactly two nonreal roots of 
f(x) in C, which shows that f(x) is not solvable by radicals over Q. 

(d) f(x) = x5 — 4x + 2. By Descartes’s rule of signs the number of real roots s 3. There 
is one real root in the interval (0,1) and one in (1,2). Hence there are exactly two 
nonreal roots of f(x) in C, which shows that f(x) is not solvable by radicals over Q. 


Chapter 18, Section 4 


(a) x} + x3 + x} = 5} — 35,5. +355, where 5, ™ xX, +X_+X3, Sy XyX2F X2X3 F 
XgXy_ Sy MX X2X3. 

(b) Let 5 © x2x3 + x3x3 + x}x}. Then 53 = (x, X2 + x2X3 + X3X,)? = 5 + 2535; that 
is, 5 = $3 — 2535). 


(c) 
(xt + xP} + x03 + x?) = [yt + x} +x} — x9) 


= [Je-20 (where {= x? + x3 + x9) 
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= — (2x2 + x39 + x9) + xh xd + x43 + x¥x3) — x2 xh} 
= — + (sj — 25,5,) — 53 
= (5? — 25s} — 25,55) — Sf. 


(d) 


3 
(x, + x2 X2 + X5KXy + X,) = [](s = x,) 


m 5) — 2(x, + X_ + X53) + Sy(X,X_ + Xp Xy FH XQXy) — XyXQXy 
= $3 — $3 + 5,5) — Sy ™ 5, Sq — Sy. 


Chapter 18, Section 5 


1. Let a= the angle 27/15; that is, a is 24°. But sin 24° = f(V15 + V3 — V10 — 25), 
which is a constructible number by Theorem 5.2. Hence, by Remark 5.9, the angle 
22/15 is constructible that is, it can be trisected by ruler and compass. 

3. cos 18° = V10 + 2 V5/4; so by Theorem 5.2, cos 18° is a constructible number. Hence, 
by Remark 5.9, angle 18° is constructible; that is, angle 54° can be trisected by ruler and 
compass. 

5. (i), (iii), and (v). 


Chapter 19, Section 1 
1. Apply Theorem |.1 and Schur’s lemma. 


Chapter 19, Section 2 
1. We use induction on n and Theorem 2.6. Assume the result is true for n = m. Then 


R, © +: OR, © Ras 
(0)© --- OOOR,4; 


so R,@® ::: OR,, and R,,,, noetherian imply, by Theorem 2.6, that R, © --- © 
Ri»+, iS noethenan. 

3. Follows from Theorem 2.3. 

5. See Problem 3 in Section | of Chapter 12. 

7. Assume RcN A = (0). Let 0™ a,c + a,c? + -:- +a,,c™", a,€ A. Then a, + a,c + 

se tac™"'=0; so —a,™a,ct+ +: ta,c™'EANRc=(0). Thus, a, =0. 
Continuing like this, we obtain a, = 0 for all i = 1,...,2. Hence, D9, Ac‘ isa direct sum 
of left ideals Ac‘. This gives an infinite properly ascending chain of left ideals: Ac C 
Ac ® Ac? C Ac @ Ac? © Ac®C °**, a contradiction. Thus, AN Rc * (0). 

9. Any descending chain A, D A, D A; > -:: ofleftideals of the ring R is also a descending 
chain of subspaces of the vector space R over D. Because [R: D] < ™, there must exist a 
positive integer n such that A, = A,,, ™ °°. Hence, R is left artinian. 

11. If possible, let @=%.,A, be the direct sum of left ideals in R. Then O22, 4,D 
@2.,A, ©O2.;A,>D -+: is an infinite properly descending chain of left ideals, a 
contradiction. 


= R, © eee © R,.; 
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Chapter 19, Section 3 


1. By the Wedderburn -Artin theorem R = R, © -:- © R,, where each R, is a matrix 
ring over a division ring. If / isan idealin R, then] = /,@ -+:: © J,, where J, isan ideal 
in R,. This implies J, = (0) or R,, since R, is a simple ring. Hence, R,/J, = R, or (0). 
After renumbering if necessary, we may assume R//] = R, X ++: X R,, where each R, is 
a nonzero matrix ring over a division ring. Clearly, the R,’s do not possess nonzero 
nilpotent ideals. Hence, R// cannot possess nonzero nilpotent ideals. 

3. Let A bea nil ideal. Because R is artinian, A is nilpotent. Thus, NC U. Next, let Bbea 
nilpotent left ideal and suppose B” = 0. Then (B + BR)” C B”R + B” (by induction). 
Hence, B + BR isa nilpotent ideal. Thus, 8+ BRC Vor BC V. This implies UC V. 
Finally, if C is a nilpotent right ideal, then, as before, C + RCis a nilpotent ideal. This 
implies C+ RC is a nil ideal, so CCN. This yields Vc N. Hence, NCUCVEN. 
Therefore, N= U= V. 

4. Hint: R= (ax + by + cla,b,c € Z), subject to x? = 1, xy= y, y? = 2y, and R is not 
artinian. 

5. Note that R is artinian with no nonzero nilpotent ideals; so by the Wedderbum - Artin 
theorem, R = direct sum of matrix rings over division rings D,. That each division ring 
D, is F follows exactly as in the proof of the Maschke theorem. 


Chapter 20, Section 3 
1. (a) 


1 0 0 
01 O 
0 0 10 


is the Smith normal form of the given matrix, which may be obtained by performing 
R,- R;, R, + 3R, C; + 3C;, C; + 2C;, Re + 2R;, R, =P 2R;, C; cs SC,, and so 
on. Thus, rank = 3. 


(b) 


1 0 0 
a 0 
00 (x+ 1x + 3) 


is the Smith normal form of the given matnx. Rank = 3. 
3. (a) Clearly, the two elements are linearly dependent over Z. So rank = |. 
(b) By performing elementary operations on 


23 1 4 
A=/|1 2 3 0 
1 1 1 4 


we can reduce 4 to 


100 0 
0 1 0 0}. 
003 0 


Thus, the rank of the subgroup is 3. 
(c) As explained in (b), we find rank = 3. 
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Chapter 21, Section 3 
(a) Because [1 1] is equivalent to [! 0], the abehan group is isomorphic to Z X Z.1Z >= 
Z. 
(b) Let 


3 -2 0 
A= I 0 1 |. 
Sil Se 


By performing elementary operations, A can be transformed to 


| 0 0 
0 1 O }. 
0 0 3 


Thus, the abelian group is Z/1Z X Z/1Z X Z/3Z = Z/3Z. 
(c) Proceeding as in the preceding problem, we find that the abelian group is Z/2Z. 


Chapter 21, Section 4 


(a) Let A be the given matrix. The invariant factors of A — x/ are 1, 1, (x — 4) 
(x — 1)? = x3 — 6x? + 9x — 4. Thus, the rational canonical form of A is 


00 4 
1 0 -9 |. 
01 6 


(b) If A is the given matrix, then the invariant factors of A — x/ are 1, 1, x? — 9x? + 
14x — 8. Thus, the rational canonical form of A is 


0 0 8 
1 0 -14 |. 
a 9 


(c) The rational canonical form of the given matnx is 


0 0 0 0 

| 0 0 l 

0 1! 0 -3 

0 0 | 3 
Chapter 21, Section 5 


1. (a) 


0 4 2 
A=|-3 8 3]. 
4-8 -2 


On A — x/, we perform (1) R, + Rj, (ii) R, + xR2, (iii) Ry — 4R2z, (iv) C, + xC,.(v) 
Cy + (x — INC), (vi) Ry + 4Qx + 2)Ry, (vin) Cy — 3Cz, (vii) Ry — Rz, (ix) Ry Ry, 
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(x) 1R,, and (xi) — 4R;, successively, and obtain 


] 0 0 
0 (x-2) 0 : 
0 0 (x — 2)? 


The invariant factors of A — x/ are 1,(x — 2),(x — 2)?, and the elementary divisors 
are (x — 2), (x — 2)?. Thus, the Jordan canonical form is 
2 
2 0 
1 2 


(b) Let A be the given matnx. The invariant factors of A — x/ are 1,1,1,(x — 4),(x — 5)°. 
Thus the elementary divisors are (x — 4), (x — 5)3; so the Jordan canonical form is 


4 


o=— AN 
— tro 
moo 


Chapter 22, Section 2 


a a = 
l. (i) -@é=—@8C=0. 
0) b 8b 
(ii) Choose integers a, b such that 6a + 7b = 1. Then 
X@ jp = X(6a + 7b)@ y = X6a@j+ x7 @yp=0+ xb@7y=0. 


(iii) Consider the map from Q x Z to Q given by (q,a)++qa which is clearly balanced. 
Thus there exists a homomorphism from Q@ 7 Z to Q that is invertible. 

(iv) As usual consider the balanced map Z,, x Z,—+Z, where (d,b)++ab. This is well 
defined, since d is the g.c.d. of m and n and induces a homomorphism «:Z,,@7 Z, — Zy. 
We can define a suitable homomorphism £:Z, — Z,,@z Z, which is the inverse of a. 

(v) Use canonical maps (e.g., see Theorem 3.1). 


Chapter 22, Section 3 


1. Consider the balanced map from M x Re to Me given by (m, ae)» mae. This induces a 
homomorphism from M @ Re to Me which is invertible. 


Chapter 22, Section 4 


I. Any element 3);m;@(2,r,;e;) of M @p F can be rewritten as > (Sjmjr,,)@e, = » jx; @e;, 
xjeM. Now M@aF = ©) -5M,,M,= M, under 1x; @e,; > Dx,. Thus 1x, @e,; =0 iff 
each x, = 0, proving uniqueness of representation of elements of M@rF, as desired. 

3. F> @d654j, A,= A. BOF = B@,2jA,=2L,B@,A;= OLB, direct sum of |J| 
copies of B. That (1 @e,),,, is a basis of B@,F is clear (see Problem 1). 

5. In Problem 3,choose A = R, B = R/M, where M is a maximal ideal. R/M 1s a field. Since 
B@®,,F is a vector space over the field B, any two bases have the same cardinality. The 
same is thus true for the free module F (see Problem 3). 
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7. Let O+4ASB4C-—0 be an exact sequence of right R-modules. Consider the induccd 
sequcncc A®,zN £,B@,N EZ, C@,N, where f* = {@1,.2* =2@1,. Since g is onto, 
it immediately follows g* is onto. Also, g*f* =(g@/,)(f@ly)=ef@l, =, since 
gf =. Thus, Im f* c Kerg*. We proceed to show Im f* = kerg”, and thus proving the 
desired result. 

Consider a mapping from C x N>B@,yN/Im f* given by (c.n)>b@n+Im /*, 
where e(b} =c. Since Im f* c Kerg*, the mapping is a well-defined balanced map and 
induces « homomorphism a:C@gN ~B@pN/Im f*. Furthermore, g* induces a 
homomorphism £: B®r N/Im f* + C @g N, whose kernel is Kerg*/Im f*. Observe that 
aff = identity, and so # is | — lt. This gives {m f * > Kerg* and so Im f{* = Kerg’*. 

()}@N nced not be a left exact functor. For 0+ Z+Q — Q/Z — 0 is exact. Hlowever. 
0+Z,@, Z7Z, @,Q is not exact since Z,@,Z7 = Z, and Z,®@,Q=0 


Chapter 22, Section 5 


I. B= M,(K) has a basis {e;,|1 <i, j <n} consisting of matrix units, as an algebra over K. 
A typical element of A @ x B is 3, ,a;;@¢;,.a,€ A Define canonical maps to show that 
A®, M,(K) =~ M,(A). 


3. Let a, b,c, deZ, b #0, d #0. The map ¢ given by (a/b, c/d) + is balanced and induces 
un algebra homomorphism ¢.Q@7Q—-Q, which hus an inverse Wy given by 
Wlajb) = a/b@ 1, for Py(a/b) = a/b and Wo(a/b @c/d) = Wlac/bd) = ac/bd@ 1 = 2° 
a ae ~ ae c . 

d 


bd db 
5 Let (e,),, be a basis of E over F. Thus p(x) =>" , x'a,eE[x] can be rewritten as 
p(x} = peep Ben x'a,,ej, &,€F, and where all but finitcly many «,,, jEA, are zero. This 
gives P(x) = Dek pj(x}e;, p(x)eF Lx]. 
Now the map (f(x),a)—~f(x)a of F[x] x E-E[x] is balanced and induces a 


homomorphism F(x] @_— E E(x], and its inverse is given by p(x) > Djp,() @e,. 


mh WN — 
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Abel, N.H. (1802-1829), 352 
abelian groups, 63, 159, 246 
Cauchy's theorem for, 146 
fundamental theorem of finitely 
generated, 141, 408 
invariants of finite, 144 
of order n upto isomorphism, 
144, 145 
type of, 144 
action of groups, 107 
by conjugation, 108, 112 
by translation, 108 
addition or sum 
of complex numbers, 36, 185 
of matrices, see matrices 
of natural numbers and integers, 
234, 240 
algebra, 170 
group, 177 
quaternion, 177 
of linear mappings, 269 
algebraic 
closure, 293, 295 
element, 289 
extension, 290 
number, 299 
Structure, 61 
system, 61 
algebraically closed field, 295 
alternating group A,, 134 
of degree n, 134 
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simplicity of, 135 
annihilator 

left and right, 195, 374 
antihomomorphism, 192 
antiisomorphism, 192 


Artin, E.(1898 1962), 297, 369-71, 386 


artinian 
mudule, 369, 370 
ring, 371, 386 
ascending chain condition (acc) 
for modules, 369 
for rings, 370 
associates, 212 
associative law, 22 
generalized, 24, 162 
automorphism 
of field, 293, 305, 314 
of group, 71, 104 
inner, 71, 104 
axiom of induction, 233 


binary operation 
associative, 22 
commutative, 22 
distributive, 22 
pointwise, 44 
Boole, G. (1815 1864), see Boolean 
ring 
Boolean ring, 168, 207 
Burnside, W. (1852~—1927) 
theorem, 114 
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cancellation law, 64, 237 
canonical form 
generalized Jordan, 423-4 
Jordan, 424 
rational, 417 
canonical or natural 
homomorphisms, 92, 189 
mapping, 18 
cardinal number, 25 
cartesian product 
of groups, 67 
of mappings, 19 
of rings and modules, 171, 248 
of sets, 9 
category, 426 
Cauchy, A.L. (1789-1857) 
theorem, 146, 147 
Cayley, A. (1821-1895) 
theorem, 86, 109 
center 
of dihedral group, 89 
of groups, 73 
of p-groups, 113 
of rings, 169 
of symmetric group, 89 
centralizer, 82 
characteristic 
of fields, 170 
function, 18 
polynomial, 414 
of rings, 170 
roots, 399, 414 
chain, 12 
chain conditions 
ascending, 369 
descending, 369 
chain of partially ordered sets, 211 
Chinese remainder theorem, 202, 
260 
class equation (formula), 113 
column 
matrix, 40 
of matrix, 40 
module, 393 
rank, 394 
space, 276 
vector, 40 
commutative 
binary operation, 21 
diagram, 16, 259, 429 


group, 63 
ring, 160 
commutator, 93 
subgroup, 93 
companion matrix, 45 
complex numbers, 35, 185 
component of ordered pair, 9 
componentwise binary operations, 
44, 171 
composition of mappings, 15 
composition series 
composition factor of, 120 
of groups and modules, 120, 387 
congruence 
modulo ideal, 184 
modulo n on Z, 10 
conjugate 
class of element of group, 111 
class of subset of group, 113 
classes of S,, 131 
of subset, 113 
constructible 
angle, 361 
circle, line, point, 359 
n-gon, 363 
number, 359 
with ruler and compass, 358 
content of polynomial, 221 
countable set, 26 
crossed homomorphism, 344 
cycles, see permutations 
cyclic groups, 73, 82 
converse of Lagrange’s theorem for, 
83 
generators of, 84 
of same order, 83 
subgroup of, 83 
cyclic endomorphism, 414 
cyclic extension of field, 344 
T-cyclic subspace, 414 
cyclic module, 250 
cyclotomic polynomial, 341 


Dedekind, R. (1831-1916) 
lemma, 323 

degree 
of dihedral group, 87 
of extension of field, 285 
of polynomial, 220 
of symmetric group, 84 
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DeMorgan, A. (1806-1871) 
rules, 6 
denumerable set, see countable set 
derivative, 307 
descending chain condition (dcc) 
for modules and rings, 369 
determinant, 49 
cofactor of element of, 53 
elementary (row, columm) 
Operations, 394 
expansion according to ith row, 54 
expansion according to jth column, 
54 
over commutative ring, 49 
product theorem of, 52 
properties, 49, 50, 51 
diamond isomorphism theorem, 98 
direct product 
of groups, 139 
of modules, 248 
of rings, 171 
direct sum 
of groups, 139 
of ideals, 196 
of modules, 248 
of rings, 171 
direct summand, 251 
disjoint 
cycles, 85, 129 
permutations, 85, 129 
sets, 5 
distributive law, 22, 159 
division algoritm 
for euclidean domain, 217 
for integers, 32 
for poynomials, 220 
divisor, 30, 212 
greatest common (g.c.d), 33, 215 
zero, 164 
domain 
embeddable in division ring (field), 
226, 230 
euclidean, 217 
integral, 163, 212 
of mapping or function, see mapping 
principal ideal, 183, 211, 216, 218, 
397, 402 
unique factorization, 
213-16, 219-23 
double quotient isomorphism 
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theorem, 99 
duplicating cube, 362 


Eisenstein, F.G.M. (1823-1852) 

criterion, 284 
element, 3 

algebraic, 289 

identity (left, right), 61, 62, 159 

inverse (left, right), 62 

invertible, 62, 172, 175 

irreducible, 212 

maximal, 211 

minimal polynomial of, 290 

order of, 75 

power of, 24 

prime, 30, 212 

regular, 224 

separable, 316 

torsion, 404 

torsion-free, 404 

unit, 172, 175 

unity, 62, 160 

von Neumann regular, 176 
elementary divisors, see matrices 
elementary operations, see matrices 
elementary symmetric functions, 356 
embeddable, see monomorphism 
embedding, 188 

of field in algebraically closed field, 

296 

of integral domain in field, 226 

of ring in ring with 1, 193 
endomorphisms 

of abelian group, 167 

of group, 70 

of module, 254, 257 
epimorphism, 70 
equal 

mappings, 14 

marices, 39 

sets, 3 
equivalence 

class, 12 

of matrices, 397 

relation, 10 
euclidean domain, see domain 
Euler, L. (1707-1783) 

function, 35 

~Fermat theorem, 35, 79, 174 
exact sequence, 259 
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extension 
algebraic, 290 
cyclic, 344 
degree of, 285 
of field, 285 
finite, 285 
finitely generated, 292 
Galois, 330 
infinite, 285 
normal, 305, 326-8 
separable, 316, 318, 319, 326 
simple, 317 
transcendental, 290 
external direct product (sum) 
of groups, 139 
of rings, 171, 198 


factor, 30 
module, see quotient 
of normal series, see normal series 
faithful 
module, 252 
representation of group, 109 
family, 20 
Fermat, P. de (1601-1665) 
theorem, 35 
fields, 37, 163 
of algebraic numbers, 299 
characteristic of, 170 
finite, 310-15 
finite extension of, 285 
finitely generated extension of, 292 
fixed (of group of automorphisms), 
323 
of fractions, 226 
Galois, 311 
Galois extension of, 330 
infinite extension of, 285 
normal extension of, 305, 326-8 
perfect, 316 
prime, 310 
radical extension of, 348 
separable element of, 316 
separable extension of, 316, 318, 
319, 326 
separable polynomial over, 316 
simple extension of, 317 
splitting, 300 
transcendental extension of, 290 
first isomorphism theorem, 97 


first principle of induction 31, 233 
Frobenius, G. (1849-1917) 
endomorphism 314, 344 
function, see mapping 
functor, 427 
contravariant, 427 
covariant, 427 
left exact, 436 
right exact, 436 
fundamental theorem 
of algebra, 299, 338 
of arithmetic, 31 
of finitely generated abelian group, 
141, 408 
of finitely generated modules, 402 
of Galois theory, 330 
of homomorphism for groups, 97 
of homomorphism for modules, 256 
of homomorphism for rings, 190 


G-sets, see action of groups 
Galois, E. (1811-1832), 352 
extension of field, 330 
field, 311 
group, 330, 350 
Gauss, C.F. (1777-1855) 
lemma, 221, 283 
Gaussian integers, 174, 217 
generalized Jordan blocks, 423 
greatest common divisor, see divisor 
groups, 62 
action of, 107 
algebra, 177 
automorphisms of, 70, 104 
centralizer of subset of, 82 
class decomposition of, 113 
class equation formula of, 114 
class of nilpotency of, 126 
commutative, 63 
commutator of, 93 
commutator subgroup of, 93 
complete, 105 
composition factors of, 120 
composition series of, 120—3 
conjugation action of, 108, 110, 112 
correspondence theorem for 
subgroups of, 100 
cyclic, 73, 82-3 
defining relations of, 86, 90 
degree of symmetric, 84 


Index 


derived, 93 

dihedral, 87 

direct product of, 139 

endomorphisms of, 70 

endomorphisms of abelian, 167 

of even permutations A,, 133 

faithful representation of, 109 

finitely generated abelian, 141, 408 

finitely generated, 75 

Galois, 330, 350 

generators and relations of, 
86, 90 

homomorphic image of, 70 

homomorphism of 69, 91 

identity of, 62 

inner automorphism of, see 
automorphism 

isomorphic, 70 

isomorphism of, 70 

isotropy, 110 

Klein four, 69, 89 

left (right) regular representation of, 
87 

linear, 66 

monomorphism of, 70 

nilpotent, 126-8 

nonsolvability of symmetric, 135 

normalizer of element (or subset) 
of, 92, 110 

nth center of, 126 

nth derived, 124 

nth power of element of, 63 

octic, 89, 95 

orbit decomposition of, 112 

orbit of element of, 111 

order of, 64 

of order eight, 152 

order of element of, 75 

of order p?, pq, 152 

permutation, 84, 86 

of prime power order, 113, 126, 144 

product of subsets of, 91 

of quasi-regular elements in ring, 
175, 176 

quaternion, 69, 95 

quotient, 92 

simple, 101 

solvable, 124-6, 351 

solvability of prime power, 126, 127 

stabilizer of subset of, 110 
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Sylow p-subgroup of, 146 
Sylow theorems for, 147, 148 
symmetric, 84, 87 

of symmetries, 65, 87--9 
transitive permutation, 353 
trivial subgroup of, 72 

of units in ring, 175, 176 


Hilbert, D. (1862 -1943) 
basis theorem, 375 
problem 90, 345 
Holder, O. (1859-1937), 121, 387 
homomorphisms 
fundamental theorems, 97, 190, 256 
Kernel and image of, 70, 71, 188, 254 
split, 268 
homomorphic image, 70, 188, 254 


idealizer, 195 
ideals, 179 
annihilator, 195, 374 
comaximal, 203 
correspondence theorem for, 191 
direct sum of, 196 
finitely generated, 182 
large, 381 
maximal, 192, 203 
minimal, 202 
nil, 210 
nilpotent, 209 
order, 420 
primary, 219 
prime, 206 
product of, 205 
radical of primary, 219 
right, left, 179 
sum of, 196 
trivial, 179 
idempotent element, 170 
identity 
endomorphism, 255 
of group, 62 
mapping, 14 
matrix, 44 
of ring, 160 
image 
of group, ring, module, 70, 188, 254 
of element, 14 
of mapping, 15 
improper divisors, 212 
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indeterminates, 166, 297 
indexed family, 20 
integers, 30, 240-5 
modulo n under addition, 64 
modulo n under multiplication, 64 
inverse (left, right), see, element 
image, 20 
invertible element, 62, 172 
irreducible 
element, 212 
polynomial, 28 
isomorphism, 70, 188, 254 
isotropy, see stabilizer 


Jordan, C. (1838-1922) 
~—Holder theorem, 121, 387 


Kernel of homomorphisms, 71, 
188, 254 
Kronecker, L. (1823-1891) 
delta, 43 
theorem, 287 


Lasker, E. (1868-1941), 391 
Lagrange, J.L. (1736-1813) 
theorem, 78 

large ideal, see ideals 
lattice, 12 

distributive, 187 

modular, 97 
leading coefficient 

of polynomial, 220 
least common multiple (I.c.m.), 35 
linear combinations of elements, 250 
linear mappings, see mapping 
linear transformations, 254, 268-78 
linearly dependent elements, 263 
linearly independent elements, 263 
list, 20 
lower pound, 11 

greatest, 11 


map, see mapping 
mapping or function, 14 
balanced, 428 
biadditive, 428 
bijective, 16 
canonical (natural), 19 
cartesian product of, 19 


codomain, 14 

composite (composition), 15 

domain of, 14 

graph of, 14 

identity, 14 

image of, 14 

inclusion, 14 

induced, 18 

injective, see one-to-one 

insertion, see inclusion 

insertion of, 14 

inverse image of set under, 19, 
100, 191 

invertible, 17 

left inverse of, 18 

linear, 254 

one-to-one, 16 

onto, 16 

preimage of element under, 14 

proiection, 19 

range of, see image of 

restriction of, 18 

right inverse of, 18 

Surjective, 16 


Maschke, H. (1853~1908) 


theorem, 385 


matrices, 39 


block multiplication, 47 
blocks of, 46 

characteristic polynomial of, 414 
characterisitc roots of, 414 
column, 40 

column of, 40 

column space of, 276 
companion, 415 

diagonal, 40 

diagonal of, 40 

dot product of, 42 
eigenvalues of, 414 
elementary (operations), 396 
elementary divisors of, 418, 424 
entry, 39 

equal, 39 

equivalent, 397 

identity, 44 

invariants of, 399 

inverse of, 44 

invertible, 44 

Jordan blocks of, 423 
Jordan-canonical form of, 424 
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of linear mappings, 268-73 
matrix units, 45 
multiplication (product) of, 42 
negative of, 41 
partitioned, 46 
rank of, 273, 400 
rank-nullity theorem of, 276 
rational canonical form of, 417 
rectangular, 40 
ring of, 164 
row, 40 
row of, 40 
row rank of, 274 
row rank of, 274 
scalar, 40 
scalar multiplication, 41 
size, 39 
Smith normal form of, 397 
square, 40 
strictly upper (lower) triangular, 40 
submatrix of, 46 
sum of, 41 
of transformation from one basis to 
another, 269 
transpose of, 45 
upper (lower) triangular, 40 
zero (null), 41 
maximal 
element, see element 
ideal, see ideals 
normal subgroup, see subgroups 
minimal 
ideal, see ideals 
polynomial, 290 
modular law, 252 
modules, 246 
artinian, 369—70 
column, 393 
completely reducible, 262 
composition series of, 387 
cyclic, 250 
direct sum of, 248 
direct sum of submodules of, 251 
endomorphisms of, 254, 257 
factor, see quotient 
faithful, 252 
finitely cogenerated, 368 
finitely generated, 250, 402 
free, 264 
homomorphic image of, 254 
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homomorphism (R-homomorphism) 
of, 254 
irreducible, 260 
isomorphic, 254 
isomorphism (R-isomorphism) of, 
254 
left and right, 246 
linear (R-linear) mapping of, 254 
noetherian, 369-70 
primary 389 
quotient, 255-6 
rank of free, 266 
row, 393 
simple, 260 
standard basis of, 264 
subisomorphic, 389 
submodule of (R-submodule), 
248, 250 
sum of submodules of, 250 
trivial submodules of, 248 
uniform, 388 
monic Polynomial, 220, 282 
monomorphism 
of groups, 70 
of rings, 188 
multiple, 30, 212 
roots, 307-10 
multiplication or product 
of complex numbers, 36, 185 
of matrices, see matrices 
of natural numbers and integers, 
239, 240 
multiplicative set, 224 
multiplicity of roots, see multiple roots 
multiplication table, 67 


n-ary Operation 21 
n-tuple, 20 
natural numbers, 30 
negative integers, 242 
Neumann, J. von (1903-1957) 
regular element, see element 
nilpotent elements, 170 
Noether, E. (1882-1935), 369-81, 391 
~Lasker theorem, 391 
noetherian 
modules, 369 
rings, 371 
norm of element, 348 
normal series, 120 


484 Index 


normal (contd.) 
equivalent, 121 
factors of, 120 
normalizer, 92, 110 
nth center of group, 126 
nth derived group, 124 
nth root of unity, 71, 340 


one-to-one mapping, see mapping 
one-to-one correspondence, 17 
between conjugate classes of S, and 
partitions of n, 131 
between family of nonisomorphic 
abelian groups of order p* and 
partitions of e, 144 
between ideals of ring and its 
quotient ring, 191 
between linear mappings and 
matrices, 269 
between subgroups of Galois group 
and subfields of field, 330 
between subgroups of group and its 
quotient group, 100 
orbits, 111 
order 
of element, 75 
of group, 64 
ideal, 415 
ordered 
basis, 268-71 
domains, 243, 244 
pair, 9 
ring, 243, 244 
set, 20 
Ore, O. (1899-1968) 
condition, 228 
domain, 228 
orthogonal family 
of idempotents, 201 
Osofsky, B., (1937--), 391 


partial order, 10 
partially ordered set (poset), 11, 211 
partition into quotient set, 12 
peano, G. (1858-1932) 

axioms, 233 
permutations 

cycle (r-cycle), 48, 85 

cycle structure of, 130 

cyclic decomposition of, 129 


disjoint, 85, 129 
even, odd, 49, 133 
p-group, see groups of prime power 


order 
Poincare, H. (1854-1912) 
theorem, 79 
pointwise operation, see binary 
operation 


polynomials, 165, 183, 219 
content of, 221 
cyclotomic, 341 
degree of, 220 
Galois group of, 330-9 
primitive, 221, 282 
reducible, 281-3 
root (zero) of, 282 
separable, 316 
solvable by radicals, 348-55 
symmetric, 355 
poset, see partially ordered set 
positive integers, 242, 245 
power of element, see element 
primitive polynomial see 
polynomials 
primitive root, 340-7 
principle of mathematical induction, 
233 -4 
projection, 19, 255 


quasi-regular element, 175 
quaternion algebra, see algebra 
quaternion group, see groups 
quotient, 32 
groups, modules, rings and sets, see 
under respective headings 


r-cycle, see permutations 
R-homomorphism (R-linear), see 
modules 

radical extension of fields, 348-9 
rank-nullity theorem, 276 
rational cononical form see matricés 
recursion theorem, 33 
regular element in ring, see element 
regular multiplicative set, 224 
relation, 9 

antisymmetric, 10 

congruence, 10 

conjugacy, 112, 113 

equivalence, 10 
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partition of set under equivalence, |2 


quotient set of set under 
equivalence, 12 
reflexive, 10 
symmetric, 10 
transitive, 10 
relatively prime elements, 215 
remainder in division algorithm, 32 
representations 
faithful, 109 
of linear mappings, 268-72 
(left) regular, 87 
right noetherian ring but not left 
noetherian, 380 
right (left) quotient ring, 230-2 
restriction of mapping, see mapping 
rings, 159 
additive group of, 159 
antihomomorphism, 192 
antiisomorphism, 192 
artinian, 371 -5, 382 -8 
artinian but not noetherian, 372 
Boolean, 168, 207 
center of, 169 
characteristic of, 170 
commutative, 160 
component, 171 
direct product of, 171 
direct sum of, 171 
division, 163 
embeddable in field (division ring), 
226, 230 
external direct sum of, 198 
factor, see quotient 
of formal Laurent series, 177 
of formal power series, 176 
of fractions, 225 
of Gaussian integers, 174, 217 
generalized associative law of, 162 
homomorphic image of, 188 
homomorphism of, 188 
idempotent clement, 170 
identity (unity) of, 160 
of integers, 159, 233, 241 
of integers mod n, 160 
invertible element, 172, 173 
isomorphic, 188 
isomorphism of, 188 
left artintan but not right artinian, 
385 
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local, 195, 226 

of matrices, 164 

monomorphism of, 188 

multiplicative set in, 224 

von Neumann regular element in, 
176 

nilpotent element tn, 170 

noetherian, 371-81 

noncommutative, 160 

opposite, 193 

ordered, 244 

polynomial 165, 219 

prime, 209 

prime subring of, 174 

quaternion, 177 

quotient, 184, 225, 226 

of quotients, 230 

of real quaternions, 163 

of real-valued continuous func- 
tions, 160, 173, 208 

regular elements in, 224 

right noetherian but not left 
noetherian, 380 

semisimple artinian, 386-8 

simple, 204 

of subsets of set. 168 

total quotient, 225 

trivial, 160 

units of, 175, 212 

unity (identity) of, 160 

with Ore-condition, 228 

without | embedded 1n ring with 1, 
193 

without zero divisors, 163-4, 212-23 

zero divisors in, 164 

zero element of, 159 

root (zero) of polynomial, 282 
ruler and compass constructions, 358 


Schur, 1. (1875 1941) 

lemma, 262 
second and third Sylow theorems, 

148 9 

second isomorphism theorem, 98 
second principle of induction, 31, 239 
semidirect product, 97 
semigroup, 61 
semisimple artinian ring, 386 8 
separable element, see element 
separable extension, see fields 
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separable polynomial, see polynomials 
sequence, 20 
series 
composition, 120-2 
normal, 120, 125 
upper central, 126 
sets, 3 
complement of, 5 
difference of, 5 
disjoint, 5 
empty, 4 
equal, 3 
equipotent, 25 
family of, 20 
finite, 4 
index, 20 
infinite, 4 
intersection of, 5 
null, 4 
power, 7 
proper subset of, 4 
quotient, 12 
subset of, 4 
union of, 4 
universal, 5 
void, 4 
simple extension, see estension 
simple root, 307 
simplicity of A,, 135 
Smith normal form, see matrices 
solvability of equation by radicals, 
348-52 
solvable Galois group, 350-2 
solvable group, see groups, 
split extension, 97 
splitting field, see fields 
squaring circle, 362 
stabilizer, see groups 
standard basis, 264 
subdivision ring, 168 
subfield, 168 
subgroups, 72 
correspondence theorem for 
subgroups, 100 
cosets of, 77 
cyclic, 73 
generated by subset, 75 
index of, 78 
internal direct product of, 139 
maximal normal, 101 


normal, 91 
proper, 72 
Sylow p-subgroup, 146 
trivial, 72 
submodules, 248 
cyclic, 250 
direct sum of, 251-2 
generated by subset, 250 
sum of, 250-1 
trivial, 248 
subring, 168 
generated by subset, 169 
prime, 174 
subset, 4 
subspace, 248 
successor of element, 233 
successor map, 233 
surjective mapping, see mapping 
Sylow, L. (1832-1918) 
p-subgroup, see subgroups 
theorems, 147-9 
symmetric function, 355 
symmetries of geometric figure, 65, 87 
symmetry (distance preserving 
permutation), 87 


tensor product, 428 
of modules, 428, 431 
of algebras, 436 
third isomorphism theorem, 99 
torsion element, see element 
torsion-free element, see element 
total quotient ring, 225 
totally ordered set (chain), 12 
trace of element, 348 
transcendental extenstions, see 
extension 
transfinite cardinal number, 26 
transitive permutation group, see 
groups 
translation, see action of groups 
transposition, 85 
trichotomy law of integers, 242, 243 
trichotomy law of natural numbers, 
237 
trisecting an angle, 363 
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unique factorization domain, see 
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This text on abstract algebra for undergraduate students is self-contained 
and gives complete and comprehensive coverage of the topics usually 
taught at this level, providing flexibility in the selection of topics to be 
taught in individual classes. All the topics are discussed in a direct and 
detailed manner. 

The book is divided into five parts. The first part contains all the fun- 
damental information necessary for using the remainder of the book: an 
informal introduction to sets, number systems, matrices, and determinants. 
The second part deals with groups. Starting with the group axioms, the 
topics discussed include G-sets, the Jordan—Hélder theorem, simplicity 
of A, and nonsolvability of S,,, n>4, the fundamental theorem of finitely 
generated abelian groups, and Sylow theorems. The third part treats 
rings and modules, including prime and maximal ideals, UFD, PID, 
rings of fractions of a commutative ring with respect to a multiplicative 
set, systematic development of integers from Peano’s axioms, vector 
spaces, completely reducible modules, and rank. The fourth part is con- 
cerned with field theory: algebraic extensions of a field, algebraically 
closed fields, normal and separable extensions, and the fundamental 
theorem of Galois theory. All these are treated with numerous examples 
worked out. Much of the material in parts I, III, and IV forms the core 
syllabus of a course in abstract algebra. The fifth part goes on to treat 
some additional topics not usually taught at the undergraduate level, 
such as the Wedderburn—Artin theorem for semisimple artinian rings, 
the Noether-Lasker theorem, the Smith normal form over a PID, finitely 
generated modules over a PID and their applications to rational and 
Jordan canonical forms, and tensor products of modules. 


Throughout the text, complete proofs have been given for all theorems 
without glossing over significant details or leaving important theorems 
as exercises. In addition, the book contains many examples fully worked 
out and a variety of problems for practice and challenge. Solutions to the 
odd-numbered problems are provided at the end of the book. 

This new edition contains an introduction to catagories and functors, 
a new chapter on tensor products, and a discussion of the new (1993) 
approach to the celebrated Noether-Lasker theorem. In addition, there 
are over 150 new problems and examples particularly aimed at relating 
abstract concepts to concrete situations. 


“A thorough and surprising clean-cut survey of the group/ring/ field 
troika which manages to convey the idea of algebra as a unified enterprise.” 
New Scientist 


